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Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post -process ing : Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 



Database : A_Geneseq_19Jun03 : * 

1 : /SIDSl/gcgdata/geneseq/geneseqp-embl/AA1980 .DAT 

2 : /SIDSl/gcgdata/geneseq/geneseqp-embl/AA1981 . DAT 

3 : /SIDSl/gcgdata/geneseq/geneseqp-embl/AA1982 .DAT 

4 : /SIDSl/gcgdata/geneseq/geneseqp-embl/AA1983 .DAT 

5 : /SIDSl/gcgdata/geneseq/geneseqp-embl/AA1984 .DAT 

6 : /SIDSl/gcgdata/geneseq/geneseqp-embl/AA1985 .DAT 

7 : /SIDSl/gcgdata/geneseq/geneseqp-embl/AA1986 .DAT 

8 : /SIDSl/gcgdata/geneseq/geneseqp-embl/AA1987 .DAT 

9 : /SIDSl/gcgdata/geneseq/geneseqp-embl/AA1988 .DAT 

10 : /SIDSl/gcgdata/geneseq/geneseqp-embl/AA1989 . DAT 

11 : /SIDSl/gcgdata/geneseq/ geneseqp-embl/AA199 0 . DAT 

12 : /SIDSl/gcgdata/geneseq/geneseqp-embl/AA1991 . DAT 

13 : /SIDSl/gcgdata/geneseq/geneseqp-embl/AA1992 . DAT 

14 : /SIDSl/gcgdata/geneseq/geneseqp-embl/AA1993 .DAT 

15 : /SIDSl/gcgdata/geneseq/geneseqp-embl/AA1994 . DAT 

16 : /SIDSl/gcgdata/geneseq/geneseqp-embl/AA1995 . DAT 

17 : / SIDSl/gcgdata/geneseq/ genes eqp-embl /AAl 996 . DAT 

18 : /SIDSl/gcgdata/geneseq/geneseqp-embl/AA1997 .DAT 

19 : /SIDSl/gcgdata/geneseq/geneseqp-embl/AA1998 . DAT 

2 0: /S I DS 1 /gcgdat a/genes eq/genes eqp - embl /AAl 999. DAT 

21 : /SIDSl/gcgdata/geneseq/geneseqp-embl/AA2000 . DAT 

22 : / SIDSl/ gcgdat a /genes eq/ geneseqp - embl /AA2 001 . DAT 

23 : /SIDSl/gcgdata/geneseq/geneseqp-embl/AA2002 .DAT 

24 : /SIDSl/gcgdata/geneseq/geneseqp-embl/AA2003 .DAT 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 



and is derived by analysis of the total score distribution. 
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RESULT 1 
ABB81074 

ID ABB81074 standard; Protein; 1163 AA. 
XX 

AC ABB81074; 
XX 

DT 05-NOV-2002 (first entry) 
XX 

DE Rat neurotransmitter receptor protein Nogo-A. 
XX 

KW Nerve regeneration; neuroprotection; neuronal degeneration; CNS; PNS; 

KW central nervous system; peripheral nervous system; tranquillizer; Nogo; 

KW vulnerary; cerebroprotective; ant i- tumour; antidiabetic; anticonvulsant ; 

KW nootropic; antiparkinsonian; ophthalmological ; analgesic; hepatotropic ; 

KW osteopathic; vasotropic; nephrotropic ; cytostatic; antigen; gene therapy; 

KW neurotransmitter receptor; rat; receptor. 
XX 

OS Rattus norvegicus . 
XX 

PN US2002072493-A1. 
XX 

PD 13-JUN-2002. 
XX 

PF 28-JUN-2001; 2 00 1US- 08 9334 8 . 
XX 

PR 19-MAY-1998; 98IL-0124500 . 

PR 21-JUL-1998; 98WO-US14715 . 

PR 22-DEC-1998; 98US-0218277 . 

PR 19-MAY-1999; 9 9US- 03 14 161 . 
XX 

PA (YEDA ) YEDA RES & DEV CO LTD. 
XX 

PI Eisenbach-Schwartz M, Hauben E, Cohen IR, Beserman P, Mosonego A; 

PI Moalem G; 

XX 

DR WPI; 2002-607255/65. 

DR N-PSDB; ABN86600. 
XX 

PT Promoting nerve regeneration and preventing neuronal degeneration in 

PT the central/peripheral nervous system from injury/disease, comprises 

PT administering nervous system-specific activated T cells/antigen, or 

PT analogs/peptides 
XX 

PS Example 5; Page 44-47; 93pp; English. 
XX 

CC The invention relates to promoting nerve regeneration or conferring 

CC neuroprotection and preventing or inhibiting neuronal degeneration in the 

CC central/peripheral nervous system (NS) . The method involves administering 
CC NS-specific activated T cells, NS-specific antigen, its analogue or its 

CC peptide, a nucleotide sequence the NS-specific antigen or its analogue or 

CC combinations. The method is useful for promoting nerve regeneration and 

CC preventing neuronal degeneration in central/peripheral nervous system 

CC from injury/disease, where the injury is spinal cord injury, blunt 

CC trauma, penetrating trauma, hemorrhagic stroke, ischaemic stroke or 

CC damages caused by surgery such as tumour excision. The disease is not an 

CC autoimmune disease or neoplasm. The disease results in a degenerative 

CC process occurring in either gray or white matter or both. The disease 



CC is diabetic neuropathy, senile dementia, Alzheimer's disease, Parkinson's 

CC disease, facial nerve (Bell's) palsy, glaucoma, Huntington's chorea, 

CC amyotrophic lateral sclerosis, non-arteritic optic neuropathy, and 

CC vitamin deficiency, intervertebral disc herniation, prion diseases such 

CC as Creutzfeldt- Jakob disease, carpal tunnel syndrome, peripheral 

CC neuropathies associated with various diseases, including but not limited 

CC to uremia, porphyria, hypoglycemia, Sjorgren Larsson syndrome, acute 

CC sensory neuropathy, chronic ataxic neuropathy, biliary cirrhosis, primary 

CC amyloidosis, obstructive lung diseases, acromegaly, malabsorption 

CC syndromes, polycythemia vera, immunoglobulin (Ig)A- and IgG gamma - 

CC pathies, complications of various drugs (e.g., metronidazole) and toxins 

CC (e.g., alcohol or organophosphates ) , Charcot-Marie-Tooth disease, ataxia 

CC telangectasia, Friedreich's ataxia, amyloid polyneuropathies, 

CC adrenomyeloneuropathy, Giant axonal neuropathy, Ref sum's disease, Fabry's 

CC disease, or lipoproteinemia . The present sequence represents the rat 

CC neurotransmitter receptor protein Nogo-A, an example of NS-specific 

CC antigen. 

XX 

SQ Sequence 1163 AA; 

Query Match 100.0%; Score 5848; DB 23;. Length 1163; 

Best Local Similarity 100.0%; Pred. No. l.le-296; 

Matches 1163; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 
Qy 1 MEDIDQSSLVSSSTDSPPRPPPAFKYQFVTEPEDEEDEEEEEDEEEDDEDLEELEVLERK 6 0 

1 1 1 ! M ! 1 1 1 1 1 ! M ; 1 1 1 II I N M I: I 1 1 Ml ! 1 1 ! 1 1 1 1 1 II I II 1 1 1 1 1 Ml IM 

Db 1 MEDIDQSSLVSSSTDSPPRPPPAFKYQFVTEPEDEEDEEEEEDEEEDDEDLEELEVLERK 60 

Qy 61 PAAGLSAAAVPPAAAAPLLDFSSDSVPPAPRGPLPAAPPAAPERQPSWERSPAAPAPSLP 120 

MMMMMMMMIIMMMIMM IMIIMM MIMMIIMM Mill III 

Db 61 PAAGLSAAAVPPAAAAPLLDFSSDSVPPAPRGPLPAAPPAAPERQPSWERSPAAPAPSLP 12 0 

Qy 121 PAAAVLPSKLPEDDEPPARPPPPPPAGASPLAEPAAPPSTPAAPKRRGSGSVDETLFALP 18 0 

1 1 1 II Mi M 1 1 1 1 1 1 1 MM 1 1 IM I! 1 1 IMM II 1 1 1 1 1 1 1 1 1 II II 1 1 1 IM I M 

Db 121 PAAAVLPSKLPEDDEPPARPPPPPPAGASPLAEPAAPPSTPAAPKRRGSGSVDETLFALP 18 0 

Qy 181 AASEPVI PSSAEKIMDLMEQPGNTVSSGQEDFPSVLLETAASLPSLSPLSTVSFKEHGYL 24 0 

1 1 1 1 1 1 M M 1 1 1 1 1 1 1 MM 1 1 1 1 1 1 1 1 M MM 1 1 1 1 1 MM I IM 1 1 1 1 IM IM 

Db 181 AASEPVI PSSAEKIMDLMEQPGNTVSSGQEDFPSVLLETAASLPSLSPLSTVSFKEHGYL 240 

Qy 241 GNLSAVSSSEGTIEETLNEASKELPERATNPFVNRDLAEFSELEYSEMGSSFKGSPKGES 3 00 

MMMIMMMIMM MUM MIMMIMI II I Mill IIIIIIIIIIMIMI 

Db 241 GNLSAVSSSEGTIEETLNEASKELPERATNPFVNRDLAEFSELEYSEMGSSFKGSPKGES 300 

Qy 301 AILVENTKEEVIVRSKDKEDLVCSAALHSPQESPVGKEDRWSPEKTMDIFNEMQMSWA 360 

Ml III 1 1 1 1 M 1 1 II 1 1 II I III I II II M I III 1 1 1 M Ml 1 1 Ml I M ;M 1 1 1 1 

Db 301 AILVENTKEEVIVRSKDKEDLVCSAALHSPQESPVGKEDRWSPEKTMDIFNEMQMSWA 360 , 

Qy 361 PWEEYADFKPFEQAWEVKDTYEGSRDVLAARANVESKVDRKCLEDSLEQKSLGKDSEGR 42 0 

IIIMMIIIIIIMIIIIIIMIIIIIMIIIIIIMMIIIIIIIIIIIIIIIIMII 

Db 361 PVREEYADFKPFEQAWEVKDTYEGSRDVLAARANVESKVDRKCLEDSLEQKSLGKDSEGR 42 0 

Qy 421 NEDASFPSTPEPVKDSSRAYI TCASFTSATESTTANTFPLLEDHTSENKTDEKKI EERKA 480 

II M II 1 1 1 1 M 1 1 1 1 1 1 1 II I III I MM 1 1 1 IM 1 1 1 M Ml 1 1 Ml I M Ml II 1 1 

Db 421 NEDASFPSTPEPVKDSSRAYI TCASFTSATESTTANTFPLLEDHTSENKTDEKKI EERKA 480 

Qy 481 QIITEKTSPKTSNPFLVAVQDSEADYVTTDTLSKVTEAAVSNMPEGLTPDLVQEACESEL 540 



IIIMMIIIIIIIIIIIIIIMIIIIIIIIMIMIIIIIMIIIIIIIIIMIMIII 

Db 481 QIITEKTSPKTSNPFLVAVQDSEADYVTTDTLSKVTEAAVSNMPEGLTPDLVQEACESE^ 54 0 

Qy 541 NEATGTKIAYETKVDLVQTSEAIQESLYPTAQLCPSFEEAEATPSPVLPDIVMEAPLNSL 600 

M 1 1 1 1 1 II M M I M I M MM II 1 1 1 M 1 1 1 1 1 1 II MM I M II 1 1 III 1 1 1 1 1 ! 

Db 541 NEATGTKIAYETKVDLVQTSEAIQESLYPTAQLCPSFEEAEATPSPVLPDIVMEAPLNSL 600 

Qy 601 LPSAGAS WQPS VS PLEAP PPVS YDS I KLE PENP P P YEEAMNVALKALGTKEG I KEPES F 660 

MIMMMIM III MMMMMIIMM MIMMIMM IIIIIIMIIMI! 

Db 601 LPSAGASWQPSVSPLEAPPPVSYDSIKLEPENPPPYEEAMNVALKALGTKEGIKEPESF 660 

Qy 661 NAAVQETEAPYISIACDLIKETKLSTEPSPDFSNYSEIAKFEKSVPEHAELVEDSSPESE 720 

IMMMIIi III MMMMMIIMM MIMMIMM MIIIIIIIMM 

Db 661 NAAVQETEAPYISIACDLIKETKLSTEPSPDFSNYSEIAKFEKSVPEHAELVEDSSPESE 720 

Qy 721 PVDLFSDDSI PEVPQTQEEAVMLMKESLTEVSETVAQHKEERLSASPQELGKPYLESFQP 780 

IMIIMIM III MMMMMMIMM MIMMMIM MIMIIIIIIIM 

Db 721 PVDLFSDDSI PEVPQTQEEAVMLMKESLTEVSETVAQHKEERLSASPQELGKPYLESFQP 780 

Qy 781 NLHSTKDAASNDI PTLTKKEKISLQMEEFNTAI YSNDDLLSSKEDKIKESETFSDSSPIE 84 0 

MIIIIMM MIMIIIIIMIIMIIM MiMMIMM IMMMIMM 

Db 781 NLHSTKDAASNDI PTLTKKEKISLQMEEFNTAI YSNDDLLSSKEDKIKESETFSDSSPIE 840 

Qy 841 IIDEFPTFVSAKDDSPKLAKEYTDLEVSDKSEIANIQSGADSLPCLELPCDLSFKNIYPK 900 

Db 841 I IDEFPTFVSAPCDDSPKLAKEYTDLEVSDKSEIANIQSGADSLPCLELPCDLSFKNI YPK 900 

Qy 901 DEVHVSDEFSENRSSVSKASISPSNVSALEPQTEMGSIVKSKSLTKEAEKKLPSDTEKED 960 

I I ; : i 1 I '< III I : ; I I : ! I I I '! I ; I I ; I I I , I I I I I M I ' I I I I I I I I I I I I I I I I 

Db 901 DEVHVSDEFSENRSSVSKASISPSNVSALEPQTEMGSIVKSKSLTKEAEKKLPSDTEKED 960 

Qy 961 RSLSAVLSAELSKTSWDLLYWRDIKKTGWFGASLFLLLSLTVFSIVSVTAYIALALLS 1020 

III MM MM 1 1 III MM MM III 1 1 III I II II I III I III Mill MM I Ml 1 1 

Db 961 RSLSAVLSAELSKTSWDLLYWRDIKKTGWFGASLFLLLSLTVFSIVSVTAYIALALLS 1020 



Qy 


1021 


VTISFRI YKGVI QAI QKSDEGHPFRAYLESEVAI SEELVQKYSNSALGHVNSTI KELRRL 

MIMmMII MMMMMIIMM MIMMIMM IIIMIIIIIIIMIMI 

VTISFRI YKGVI QAI QKSDEGHPFRAYLESEVAI SEELVQKYSNSALGHVNSTI KELRRL 


1080 


Db 


1021 


1080 


Qy 


1081 


FLVDDLVDSLKFAVLMWVFTYVGALFNGLTLLI LALI SLFS I PVI YERHQVQI DHYLGLA 

MMIIMM MIMIMMMMM MMMMMIIMM MIMIMMM 

FLVDDL VDSLKFAVLMW VFT Y VGALFNGLTLL I LAL I SLFS I PVI YERHQVQI DHYLGLA 


1140 


Db 


1081 


1140 



Qy ■ 1141 NKS VKDAMAKI QAKI PGLKRKAD 1163 

I M 1 1 1 1 II M M I Ml M M 

Db 1141 NKS VKDAMAKI QAKI PGLKRKAD 1163 



RESULT 2 
AAY71310 

ID AAY71310 standard; Protein; 1163 AA. 
XX 

AC AAY71310; 
XX 

DT 02-NOV-2000 (first entry) 
XX 

DE Rat neurite growth inhibitor Nogo A. 



XX 

KW Rat; neurite growth inhibitor; Nogo A; neural cell; myelin; CNS; 

KW central nervous system; neoplastic disease; antiproliferative; glioma; 

KW antisense gene therapy; neuroblastoma; menagioma; retinoblastoma; 

KW degenerative nerve disease; Alzheimer's disease; Parkinson's disease; 

KW hyperprolif erative disorder; benign dysprol iterative disorder; diagnosis; 

KW psoriasis; tissue hypertrophy; neuronal regeneration; treatment; 

KW structural plasticity; screening. 

XX 

OS Rattus sp. 
XX 

FH Key Location/Qualifiers 

FT Inhibitory-site 1..171 

FT /note* "Inhibits NIH 3T3 fibroblast spreading" 

FT Modif ied-site 30 

FT /note= "Casein kinase II site" 

FT Region 31 . .58 

FT /ndte= "Acidic region" 

FT Region 31 . . 57 

FT /note= "Region specifically described in claim 16" 

FT Modif ied-site 233 

FT /note= "Protein kinase C (PKC) site" 

FT Modif ied-site 242.. 244 

FT /note= "Asn is N-glycosylated" 

FT Modif ied-site 291 

FT /note= "Protein kinase C (PKC) site" 

FT Modif ied-site 295 

FT /note= "Protein kinase C (PKC) site" 

FT Misc-dif f erence 404 

FT /note- "Encoded by TTG " 

FT Modif ied-site 436 

FT /note= "Protein kinase C (PKC) site" 

FT Modif ied-site 468.. 470 

FT /note= "Asn is N-glycosylated" 

FT Modif ied-site 484 

FT /note= "Protein kinase C (PKC) site" 

FT Modif ied-site 488 

FT /note= "Protein kinase C (PKC) site" 

FT Modif ied-site 502 

FT /note= "Casein kinase II site" 

FT Modif ied-site 1 576 

FT /note= "Casein kinase II site" 

FT Modif ied-site 626 

FT /note= "Protein kinase C (PKC) site" 

FT Modif ied-site 694.. 696 

FT /note= "Asn is N-glycosylated" 

FT Modif ied-site 715 

FT /note= "Casein kinase II site" 

FT Modif ied-site 784 

FT /note= "Protein kinase C (PKC) site" 

FT Modif ied-site 821 

FT /note- "Protein kinase C (PKC) site" 

FT Modif ied-site 850 

FT /note= "Protein kinase C (PKC) site" 

FT Modif ied-site 855 

FT /note= "Protein kinase C (PKC) site" 

FT Modif ied-site 863 



FT /note^ "Casein kinase II site" 

FT Modif ied-site 868 

FT /note- "Protein kinase C (PKC) site" 

FT Modif ied-site 893 

FT /note= "Protein kinase C (PKC) site" 

FT Modif ied-site 912.. 914 

FT /note= "Asn is N-glycosylated" 

FT Modif ied-site 925.. 927 

FT /note= "Asn is N-glycosylated" 

FT Modif ied-site 954 

FT /note- "PKC and casein kinase II sites" 

FT Modif ied-site 956 

FT /note= " PKC and casein kinase II sites" 

FT Domain 988 . . 1023 

FT / label = Transmembrane_domain 

,FT /note= "C- terminal hydrophobic region 

FT specifically described in claim 16" 

FT Modif ied-site 1024 

FT /note- "Protein kinase C (PKC) site" 

FT Modif ied-site 1071.. 1073 

FT /note= "Asn is N-glycosylated" 

FT Modif ied-site 1073 

FT /note- "Protein kinase C (PKC) site" 

FT Modif ied-site 1089 

FT /note- "Protein kinase C (PKC) site" 

FT Domain 1090 . . 1125 

FT / label = Transmembrane_domain 

FT /note- "C- terminal hydrophobic region 

FT specifically described in claim 16" 

FT Modif ied-site 1141.. 1143 

FT /note- "Asn is N-glycosylated" 

FT Modif ied-site 1143 

FT /note- "Protein kinase C (PKC) site" 

FT Peptide 623 . . 640 

FT /note- "used as immunogen to generate antibody AS 472" 

FT Peptide 762 . . 1163 

FT /note- "used as immunogen to generate antibody AS Bruna" 

FT Inhibitory-site 542.. 722 

FT Region 172 . .259 

FT /note= "This region is not essential for inhibitory 

FT activity" 

FT Region 975 . . 1162 

FT /note- "This region is not essential for inhibitory 

FT " activity" 

FT Region 976. .1163 

FT /note= "C- terminal common region found in Nogo A, B and 

FT C isoforms" 

XX 

PN WO200031235-A2 . 
XX 

PD 02-JUN-2000. 
XX 

PF 05-NOV-1999; 99WO-US26160 . 
XX 

PR 06-NOV-1998; 98US- 01 07446 . 
XX 

PA (SCHW/) SCHWAB M E . 



PA (CHEN/) CHEN M S. 
XX 

PI Schwab ME, Chen MS; 
XX 

DR WPI; 2000-400052/34. 

DR N-PSDB; AAD01173 . 
XX 

PT Nogo proteins and nucleic acids useful for treating neoplastic 

PT disorders of the central nervous system and inducing regeneration of 

PT neurons - 

XX 

PS Claim 3; Fig 2A; 122pp ; English. 
XX 

CC The present sequence is a rat Nogo A protein which is a 

CC potent neural cell growth inhibitor and is free of all central nervous 

CC system (CNS) myelin material with which it is natively associated. The 

CC protein was derived from a cDNA generated by fusing R018U37-3, R1-3U21 

CC cDNAs isolated from hexanucleot ides -primed rat brain stem/spinal cord 

CC library, and 01il8 cDNA from an oligo d{T) -primed rat oligodendrocyte 

CC library. Nogo proteins and fragments displaying neurit e growth 

CC inhibitory activity are used in the treatment of neoplastic disease of 

CC the CNS e.g. glioma, glioblastoma, medulloblastoma , craniopharyngioma, 

CC ependyoma, pinealoma, haemangioblastoma , acoustic neuroma, 

CC oligodendroglioma, menagioma, neuroblastoma or retinoblastoma and 

CC degenerative nerve diseases e.g. Alzheimer's and Parkinson's diseases. 

CC Therapeutics which promote Nogo activity can be used to treat or prevent 

CC hyperprolif erative or benign dysprolif erative disorders e.g. psoriasis 

CC and tissue hypertrophy. Ribozymes or antisense Nogo nucleic acids can be 

CC used to inhibit production of Nogo protein to induce regeneration of 

CC neurons or to promote structural plasticity of the CNS in disorders where 

CC neurite growth, regeneration or maintenance are deficient or desired. 

CC The animal models can be used in diagnostic and screening methods for 

CC predisposition to disorders and to screen for or test molecules which 

CC can treat or prevent disorders or diseases of the CNS. 

CC Note: The present sequence designated as SEQ ID NO: 2 is stated to 

CC be the same as the sequence shown in Fig. 13 (see AAY71384) of the 

CC specification. However, this sequence does not match the sequence given 

CC in Fig. 13. SEQ ID numbers 35-42 are referred in claim 32 and 

CC SEQ ID NO: 29 in disclosure of the specification. However, the 

CC specification does not include sequences for these SEQ ID numbers. 

XX 

SQ Sequence 1163 AA; 

Query Match 100.0%; Score 5846; DB 21; Length 1163; 

Best Local Similarity 99.9%; Pred. No. 1.4e-296; 

Matches 1162; Conservative 1; Mismatches 0; Indels 0; Gaps 0; 
Qy 1 MEDIDQSSLVSSSTDSPPRPPPAFKYQFVTEPEDEEDEEEEEDEEEDDEDLEELEVLERK 60 

I : il 1 1 1 1 1 1 ill MM l! II 1 1 1 1 1 1 II 1 1 1 1 II 1 1 1 1 1 1 M 1 1 1 !M M II 

Db 1 MEDIDQSSLVSSSTDSPPRPPPAFKYQFVTEPEDEEDEEEEEDEEEDDEDLEELEVLERK 60 

Qy 61 PAAGLSAAAVPPAAAAPLLDFSSDSVPPAPRGPLPAAPPAAPERQPSWERSPAAPAPSLP 12 0 

I : I I I I Ml I I I I I I Ml M ■ I :M I I Ml I IM 

Db 61 PAAGLSAAAVPPAAAAPLLDFSSDSVPPAPRGPLPAAPPAAPERQPSWERSPAAPAPSLP 12 0 

Qy 121 PAAAVLPSKLPEDDEPPARPPPPPPAGASPIAEPAAPPSTPAAPKRRGSGSVDETLFALP 18 0 

I Ml I U I I I I Ml MIM M ' MM I MMI II I I 



Db 



121 PAAAVLPSKLPEDDEPPARPPPPPPAGASPLAEPAAPPSTPAAPKRRGSGSVDETLFALP 180 



Qy 181 AASEPVIPSSAEKIMDLMEQPGNTVSSGQEDFPSVLLETAASLPSLSPLSTVSFKEHGYL 240 

II M 1 1 1 MMMM I Ml 1 1 1 M I MM 1 1 1 II I M 1 1 1 1 1 1 1 1 Ml Ml M I MMM 

Db 181 AASEPVIPSSAEKIMDLMEQPGNTVSSGQEDFPSVLLETAASLPSLSPLSTVSFKEHGYL 240 

Qy 241 GNLSAVSSSEGTIEETLNEASKELPERATNPFVNRDLAEFSELEYSEMGSSFKGSPKGES 300 

IIIIIMIIIIIIMIIIMMIIIMIIIIMMIMMIIIMIIIIIIIIIIIIIII 

Db 241 GNLSAVSSSEGTIEETLNEASKELPERATNPFVNRDLAEFSELEYSEMGSSFKGSPKGES 3 00 

Qy 3 01 AI LVENTKEE VI VRS KDKEDLVCSAALHS PQES P VGKEDRWS PEKTMD I FNEMQMS WA 360 

IIIIIMIIIMIMIIIliMIIIMIMIIMIIMIIIIIIIIIIIIIIMIIIIII 

Db 3 01 AILVENTKEEVI WSKI3 KEDLVCSAALHS PQES PVGKEDRWS PEKTMD I FNEMQMS WA 360 

Qy 361 PVREEYADFKPFEQAWEVKDTYEGSRDVLAARANVESKVDRKCLEDSLEQKSLGKDSEGR 420 

!l 1 1 Ml 1 1 1 M I M MM M II I Ml II Ml M M I M II Ml IM II I III II I II 

Db 3 61 PWEEYADFKPFEQAWEVKDTYEGSRDVLAARANVESKVDRKCIEDSLEQKSLGKDSEGR 420 

Qy 421 NEDAS F PSTPE PVKDS SRAY I TCAS FTSATESTTANTFPLLEDHTS ENKTDEKKI EERKA 480 

Mill I Mill III II IIIIIIIIIIIIIMMIIIIIilll IIMII III Mill MM 

Db 421 NEDASFPSTPEPVKDSSRAYI TCAS FTSATESTTANTFPLLEDHTSENKTDEKKI EERKA 480 

Qy 481 QIITEKTSPKTSNPFLVAVQDSEADYVTTDTLSKVTEAAVSNMPEGLTPDLVQEACESEL 540 

1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 11 1 1 1 1 i ! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

Db 481 QIITEKTSPKTSNPFLVAVQDSEADYVTTDTLSKVTEAAVSNMPEGLTPDLVQEACESEL 540 

Qy 541 NEATGTKIAYETKVDLVQTSEAIQESLYPTAQLCPSFEEAEATPSPVLPDIVMEAPLNSL 600 

I f 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

Db 541 NEATGTKIAYETKVDLVQTSEAIQESLYPTAQLCPSFEEAEATPSPVLPDIVMEAPLNSL 600 

Qy 601 LPSAGASVVQPSVSPLEAPPPVSYDSIKLEPENPPPYEEAMNVALKALGTKEGIKEPESF 660 

M 1 1 MM 1 1 II I IM 1 1 1 1 M 1 1 Ml 1 1 M M 1 1 M MM I IM II 1 1 1 III 1 1 1 1 1 

Db 601 LPSAGASVVQPSVSPLEAPPPVSYDSIKLEPENPPPYEEAMNVALKALGTKEGIKEPESF 660 

Qy 661 NAAVQETEAPYISIACDLIKETKLSTEPSPDFSNYSEIAKFEKSVPEHAELVEDSSPESE 720 

MIIIIIIIIMIMIIMMIIIIIIIIIMIIMIIIIIIIMIIIIIIIIIIIIMI 

Db 661 NAAVQETEAPYISIACDLIKETKLSTEPSPDFSNYSEIAKFEKSVPEHAELVEDSSPESE 720 

Qy 721 PVDLFSDDS I PEVPQTQEEAVMLMKESLTEVSETVAQHKEERLSASPQELGKPYLESFQP 780 

1 1 1 1 ! 1 1 1 1 1 1 1 ! 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M I E 1 1 1 1 1 1 1 1 1 1 ! 

Db 721 PVDLFSDDS I PEVPQTQEEAVMLMKESLTEVSETVAQHKEERLSASPQELGKPYLESFQP 780 

Qy 781 NLHSTKDAASNDI PTLTKKEKISLQMEEFNTAIYSNDDLLSSKEDKIKESETFSDSSPIE 840 

I i 1 1 1 i I f 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 E 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

Db 781 NLHSTKDAASNDI PTLTKKEKI SLQMEEFNTAI YSNDDLLSSKEDKI KESETFSDSSPI E 840 

Qy 841 IIDEFPTFVSAKDDSPKLAKEYTDLEVSDKSEIANIQSGADSLPCLELPCDLSFKNIYPK 900 

! 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 i 1 1 1 1 1 1 1 i 1 1 1 1 f 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

Db 841 1 1 DEF PTF VSAKDDS P KLAKE YTDLEVSDKS E I AN I QSGADSLPCLELPCDLS FKN I YP K 900 

Qy 901 DEVHVSDEFSENRSSVSKASISPSNVSALEPQTEMGSIVKSKSLTKEAEKKLPSDTEKED 960 

MM 1 1 1 1 1 1 M II I IM 1 1 1 1 1 1 1 MM M 1 1 M 1 1 1 1 M II I IM II I M Mill 1 1 1 

Db 901 DEVHVSDEFSENRSSVSKASISPSNVSALEPQTEMGSIVKSKSLTKEAEKKLPSDTEKED 960 

Qy 961 RSLSAVLSAELSKTSVVDLLYWRDIKKTGWFGASLFLLLSLTVFSI VSVTAYIALALLS 1020 

II M 1 1 1 M I IM 1 1 MM 1 1 M I MM M I II I M 1 1 1 M 1 1 MM M I M 1 1 1 MM 

Db 961 RSLSAVLSAELSKTSWDLLYWRDIKKTGWFGASLFLLLSLTVFSI VSVTAYIALALLS 102 0 



Qy 1021 VTI SFRI YKGVI QAI QKSDEGHPFRAYLESEVAI SEELVQKYSNSALGHVNSTI KELRRL 1080 

I I I I ll I l!i I I I I I I II I I ! I I I I I I ! II I I I I II I II I II II I II I Ml I 

Db 1021 VTI SFRI YKGVI QAI QKSDEGHPFRAYLESEVAI SEELVQKYSNSALGHVNSTI KELRRL 1080 

Qy 1081 FLVDDLVDSLKFAVLMWVFTYVGALFNGLTLLI LALI SLFS I PVI YERHQVQI DHYLGLA 1140 

M M .: 1 1 1 1 1 1 III II 1 1 1 1 i I III Mi I II I II i III II I II Ml 1 1 1 II 1 1 

Db 1081 FLVDDLVDSLKFAVLMWVFTYVGALFNGLTLLI LALI SLFS I PVI YERHQVQI DHYLGLA 1140 

Qy 1141 NKSVKDAMAKIQAKIPGLKRKAD 1163 

IIIIIIMIIIIIIIMIIIIII 
Db 1141 NKSVKDAMAKIQAKIPGLKRKAD 1163 



RESULT 3 
AAY71557 

ID AAY71557 standard; Protein; 1162 AA. 
XX 

AC AAY71557; 
XX 

DT 02-NOV-2000 (first entry) 
XX 

DE Rat Nogo A truncated protein used in the construction of mutant Nogo-A. 
XX 

KW Rat; neurite growth inhibitor; Nogo A; neural cell; myelin; CNS; 

KW central nervous system; neoplastic disease; antiproliferative; glioma; 

KW antisense gene therapy; neuroblastoma; menagioma; retinoblastoma; 

KW degenerative nerve disease; Alzheimer's disease; Parkinson's disease; 

KW hyperprol if erat ive disorder; benign dysprol iterative disorder; diagnosis; 

KW psoriasis; tissue hypertrophy; neuronal regeneration; treatment; 

KW structural plasticity; screening; mutant; mutein. 

XX 

OS Rattus sp . 
XX 

PN WO200031235-A2 . 
XX 

PD 02-JUN-2000. 
XX 

PF 05-NOV-1999; 99WO-US26160 . 
XX 

PR 06-NOV-1998; 98US- 010744 6 . 
XX 

PA (SCHW/) SCHWAB M E. 
PA (CHEN/ ) CHEN MS. 
XX 

PI Schwab ME, Chen MS; 
XX 

DR WPI; 2000-400052/34. 
XX 

PT Nogo proteins and nucleic acids useful for treating neoplastic 

PT disorders of the central nervous system and inducing regeneration of 

PT neurons - 

XX 

PS Example; Page -; 122pp; English. 
XX 

CC The patent relates to neurite growth inhibitor Nogo which is free of 
CC all central nervous system (CNS) myelin material with which it is 



CC natively associated. Nogo proteins and fragments displaying neurite 

CC growth inhibitory activity are used in the treatment of neoplastic 

CC disease of the CNS e.g. glioma, glioblastoma, medullobla stoma , 

CC craniopharyngioma, ependyoma, pinealoma, haemangioblastoma , acoustic 

CC neuroma, oligodendroglioma, menagioma, neuroblastoma or retinoblastoma 

CC and degenerative nerve diseases e.g. Alzheimer's and Parkinson's 

CC diseases. Therapeutics which promote Nogo activity can be used to treat 

CC or prevent hyperprolif erat ive or benign dysproliferative disorders e.g. 

CC psoriasis and tissue hypertrophy. Ribozymes or antisense Nogo nucleic 

CC acids can be used to inhibit production of Nogo protein to induce 

CC regeneration of neurons or to promote structural plasticity of the CNS 

CC in disorders where neurite growth, regeneration or maintenance are 

CC deficient or desired. The animal models can be used in diagnostic and 

CC screening methods for predisposition to disorders and to screen for or 

CC test molecules which can treat or prevent disorders or diseases of the 

CC CNS. The present sequence is a truncated form of rat Nogo A protein shown 

CC in AAY71310, which is used in the construction of mutant Nogo-A. Nogo-A 

CC is composed of His-tag/T7-tag/vector/Nogo-A sequence aa 1-1162. 

CC Nogo A deletion mutants were used for mapping the inhibitory sites of 

CC Nogo protein. Major inhibitory region was identified in the 

CC Nogo A sequence from amino acids 172-974, particularly amino acids 

CC 542-722. In addition, N-terminal region 1-171 was found to be inhibitory 

CC to NIH 3T3 fibroblast spreading. 

CC Note: The present sequence is not given in the specification but is 

CC derived from rat Nogo A sequence shown in AAY71310. SEQ ID numbers 35-42 

CC are referred in claim 32 and SEQ ID NO: 29 in disclosure of the 

CC specification. However, the specification does not include sequences for 

CC these SEQ ID numbers. 

XX 

SQ Sequence 1162 AA; 

Query Match 99.9%; Score 5840; DB 21; Length 1162; 

Best Local Similarity 99.9%; Pred. No. 2.8e-296; 

Matches 1161; Conservative 1; Mismatches 0; Indels 0; Gaps 0; 
Qy 1 MEDIDQSSLVSSSTDSPPRPPPAFKYQFVTEPEDEEDEEEEEDEEEDDEDLEELEVLERK 60 

I I I I I I Ml I I MM I II I I MM M M M l I M I I II I I I I I I I I I I I II I 

Db 1 MEDIDQSSLVSSSTDSPPRPPPAFKYQFVTEPEDEEDEEEEEDEEEDDEDLEELEVLERK 60 

Qy 61 PAAGLSAAAVPPAAAAPLLDFSSDSVPPAPRGPLPAAPPAAPERQPSWERSPAAPAPSLP 12 0 

I M I I MM I M M l I I Ml I I I M I I I I I IM I I I M I M I I I II I MM I I II I 

Db 61 PAAGLSAAAVPPAAAAPLLDFSSDSVPPAPRGPLPAAPPAAPERQPSWERSPAAPAPSLP 12 0 

Qy 121 PMAVLPSKLPEDDEPPARPPPPPPAGASPLAEPAAPPSTPAAPKRRGSGSVDETLFALP 18 0 

1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 M 1 1 ,11 1 1 1 1 . 1 1 1 M I II I II II I II II 

Db 121 PAAAVLPSKLPEDDEPPARPPPPPPAGASPLAEPAAPPSTPAAPKRRGSGSVDETLFALP 180 

Qy 181 AASEPVI PSSAEKIMDLMEQPGNTVSSGQEDFPSVLLETAASLPSLSPLSTVSFKEHGYL 240 

M 1 1 M 1 1 1 1 M 1 1 1 ; 1 1 Mil I IM 1 1 II I II 1 1 , I IMM 1 1 MUM II II 

Db 181 AASEPVIPSSAEKIMDLMEQPGNTVSSGQEDFPSVljLETAASLPSLSPLSTVSFKEHGYL 240 

Qy 241 GNLSAVSSSEGTI EETLNEASKELPERATNPFVNRDLAEFSELEYSEMGSSFKGSPKGES 300 

I II 1 1 1 1 II 1 1 1 IN ■ I III 1 1 Ml Ml I M 1 1 1 1 I I II 1 1 1 1 1 !l 1 1 1 1 III 

Db 241 GNLSAVSSSEGTI EETLNEASKELPERATNPFVNRDLAEFSELEYSEMGSSFKGSPKGES 300 

Qy 3 01 AILVENTKEEVIVRSKDKEDLVCSAALHSPQESPVGKEDRWSPEKTMDIFNEMQMSWA 360 

II 1 1 1 IM 1 1 M 1 1 1 1 I III 1 1 1 1 1 1 1 1 1 III M I M ■ 1 1 II II I II I MM I II I 



Db 



301 AILVENTKEEVIVRSKDKEDLVCSAALHSPQESPVGKEDRWSPEKTMDIFNEMQMSWA 360 



Qy 




D^rpppvAnptfDPPnAWPVP< r nT , vppcD , nA 7 T a APAMVPQi< r \7"n'PJ<rp , T pnQT ppivct PtmcppiP 


a 9 n 


JJD 


^ £ 1 
Jul 


MIMi MMMIMMMIMMMMMIMM II IIIIIMIIMIIMIIIIIII 

p^TT^ppvAnpT^pppnAWPVT^nTvpPcpnT.TT a ap amvpctvttt^p'k'pt pnQT ppiitqt pktiQppp 


a 9 n 


Pit T 


'X Z X 


IM JiiJ/ior rr o 1 ir Jiir V rvLOoKi-i. X 1 x LHor 1 oil IdoI X/\1M X r ir XjijijUrl 1 dcjJn i\.X JJCjJ\J\X CjCjKJ\i\ 


a ft n 


JJD 


T: Z X 


llllll MIMIIII'MMMMMMM IIIIIMII Illlllllllllllllllllll 

1M J-jU/tLiD r tr O 1 IT Hitr V IVlJOOJvrVX X X LiiDr X iD/\ X XiO X X^vLM X c tr XjXjXLUXI X O HXM iYX XJXjXViVX LjCjiviaJ-I 


A ft fi 


qy 


ART 


PIT T TPWTCPlfTQMPPT \7A VHHQP A nVT7T'T , 'n r T'T QT^ATTPA A VQTVTM DPPiT TDnT VPlPA PPCPT 
1 XoJ\XoirJ\lolNirr Xj Vii V^UoJZjH.JJ I V X 1 JJ 1 XjoJW 1 slu\i\ V oXNl/Jir XIiOXj X itXJXj Cirt.L-IljOiLXj 

MUM IIIIIMII MMMMMMMI lllllll llllll MMMM llllllll 1 


RAH 
D4 U 


UD 


/I Q 1 

4 o X 


AT T TCWTC DyTCMDfT T 7A T 7C\T\QT?7\ TlVT T T T"T I T*\T'T 0\Z\WX?T\ A T 7CMMPT?PT TDnT T 7f~\T? A PT? C T?T 
yXX lHjJXlbFJxl oiNlFr Xj VA V^JJociAJJ x V 1 X JJ IXjoJW 1 HjAA V oJNl v J JtHjoXj X JtJJXj VyiliAUJlrOCjXj 


c / n 


p, , 

Qy 


b4 1 


TvTD A TPTl/T A VDTTZT rP\T T JPHPCD A T PDCT VDTA PT PDCDDITA DA T'DC DT 7T TinT^ 7TVID A DT MOT 

JNKA.101 1x1 Ax Ji 1 JxVJJxjVy x biLAl ^hjoij i F lAyj-iLFox^ JiJiAJiA 1 FbrVLrUJ. VMhiArljJNblj 

1 Mill lllllllilllMIIMI IIMI llllll Ml IllllllllMIIII MM II 


bUu 


JJD 


CM 1 


A VCTITT 7"PiT \ 7PTCT? A T PT?CT VDTA OT PDCT?l?I?A TJ 1 A TD C DT7T DPlT A PT 1STCT 

JNIhiAlLrl JvIAYEjI JvVJJijVy 1 bJiAiyxijOXjY r lAtJijL-irbr JiJiAhiAl JroJr VLiPUl VFJKAr J_jJN!blj 


/" A A 

b UU 


Pit 7 

Qy 


oUl 


T PQAfS Q\A7PlPCUC! PT PA PPPVQVTlC! T I^T PPPMPPPVPP AMM\7A T V AT PTITPP 1 T yPPPCP 
Xj r o>I\{j}\ o V Vl^fJro V j r Liiiit\ir ir ir V o x XJo X rvLiJir^ililM ir r x Cj XIiAXmXN V/\XjJ\/\XjLi X JViSoX JvJiJr Ji«oJ^ 


/r /r a 
bb U 


JJD 


DUX 


1 Mill III ilMMMIMMMIMM Illlllllllllllllllllll II MM III 

T PCAPA C\ A7PlPC\J r CPT PA PPP\7CVnC!T VJ PPP1VTPPPVPPAMT\.T\7A T VAT PTTTPP T T^PPPQP 
JjroiiuiiD vvi^rovo tr XjJ1iH.it it tr V o I JJo X JMjJiir H>vt c xr tr X Ci rVHX v iXM ViA.XjjAjH.Xjo X JVJioX JXJiir ILor 


DDL) 


pit/ 

Qy 


^ £ i 
o b x 


MA A^^PlPTPA PVT CT A PnT T VTT?TW1 CTt?PQPnT7CMVCT?T AT^PPT^Q^rDPT-TZi PT \7PnCG PPCP 
Ni-vH. ill 1 Cirt.Jr x X o X>iUJJXjX r^Ht X J\Xjo 1 izjiroJrJJJ? olN x o 1 ±\ J\.r HiJ\.o V lr X1jJrl/\CjJ_j Vx^JJoo JtCjOCj 


/ z U 


JJD 


DDI 


llllll llllll IMMMMIMMIMI IMMilMMM llllllll IIIIIMII 

MA A'^TPiPT'PA PVT CT A PPlT T I/DTVT CTPDCDnPCMVCPT A l / T7Ul< r C\7'DT?UA PT UPnCCDCCl? 

NAAVyilj 1 jziAF x 1 b 1 ALJJJ_il jtvxIj 1 J\J_ib 1 JZjPbFJJr bJM xbiilAJA.r JiJVb vr JiHAJiJjVJiUbbr JibJi 


1 O A 

/z U 


Pit 7 

qy 


/ Z X 


P\/HT .PQnnQT PP^/PPlTPlPPA^/MT MVPQT .T'P\7QP r T 1 VAPn-II<TPPPT .Q A QPPlPT PiVPVT PCPPiD 
trvULic oJJJJo X rr Ji V tr^) H^CjEjjH, Vl v JXjl v JJ\JibXj 1 a VOCj X ViHvJiJA-Ji- t!j -t^ J - J bAb r^&Lt\j r^tr 1 liHor ^ Jr 

MUM IMIMIII MIIMIIIIMIII Mill M MINI llllllll llllll II 1 


7 ft n 


JJD 


TO 1 


F VJJijr bJJJJb 1 ircj V Fy vl v lj_ilYlKhjbJ-jl b VbJi 1 VAyJrlKJiJiKijbAb FybijoKF Y XiJib J^ yF 


TQ A 

/ o U 


Pit 7 

qy 


/ o X 


TJT UQTl^HA A QNTFlT DTT r r'W"E^PT^T CT PiMPPpTaT 1 A TVCMnHT T CCT^PniTT VPCPTPCTiC C P T P 
LM J_jiXO 1 J^XJiHJ-lLDLNI JJX ir X Xj X J\J\.JiJ\.X o Xj^l v JJi Hitr i\l X-H.1 X oXNXJiJXjijob J\.JZj1JJ\.X J\.i1jO Hi 1 r o JJbbF 1 Hi 


O 4 U 


UJJ 


"7 Q 1 


1 Mill IIIMIIMIIMIIMI IIMI llllll Ml llllll lllllll M MM III 

"NTT UCT^nB A CMnT PTT TVVWVT CT AMPPPMTA TVCwnnT T CCVPT^tTT ITIT 1 CPTfCnC C P T T? 
LM J_irlo 1 jAXJAAbJNJJX tr 1 Xjx J\J\JlJ\X b J_i YlJl Jl J^ i\ 1 AX x bXM JJJJXjXjbbJA.JZjJJJ\.l JxiLbJi I r oUz>C>ir 1 Ji 


C) 4 U 


Pit 7 

Qy 


Q A 1 
O 4 X 


T T HiCC DTTTT TO A 1/nnCDVT A VT?VTnT DT fCnVCDT ATvTTPiCPATiCT D PT T?T DP"PiT OTT/"KTT VD V 

1 lUbr Fir V bAJ\UUbF JvLAKJix I JJxjJi VbUKbhilAxNJ l^boAUbijFUj-ibJjFLJJijbr ivN 1 x FK 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 i 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 II II II II 1 1 1 


Q A A 


JJD 


o 4 1 


1 1 DhjJ? Fib VbAl\JJUbFJ\±iAJvriX lULih, VbDKbhlAJNI 1 yboADbxjFLljbJjFLJJljbr J\JM i 1 FK 


Q A A 

y uu 


Pit 7 

Qy 


qni 

y u x 


PlPT 7T-TT 7CP1PPCPMP CCT 7CT/A C T C DCMT 7C A T U 1 DPiTTTMP C T T 7VCVCT T'T/'T? A T?L r L r T T) C r^TTJ 1 VCR 

JJcj V rl VbJJJir btJNKbb Vb J\Ab I bFbJM VbALiJiFy 1 JiFlob 1 V Kbivb-Lil KhjAbivlvLFbul JiJvhiU 


y o u 


1JD 




llllll IIIIMII IIMIIIIIIIIIIMIIIil II llllll IIIIMII MMM II 1 

PiPT 7T-JT 7CPIPPCPMD C CT 7Clf A C T C DCMT 7C AT C DPlTT?MP C T T 7 TV C V CT T'T/'T? A rT/T/T D C RTTVUfl 

JJtli VJi.\/bJJJiJr bJiiMKbb vbi\Ab-L bFblNI VbAJ-ibFy 1 blVlLibl VKbr\bJ_i 1 KiliAhJ\jMjFbUl JiKbU 


y bu 


PU 7 

Qy 


y o i 


DCT CAT7T CADT CT/ r PCT 7T 7T~\T T VTnTn T~\ T TVL rr P/~ 1 T 7T 7'D/*^ ACT T7>T TT CT <"PT TTT? C T T TCT TT» A VT TIT AT T C 

KbijbAVJjbAbJjbKl b V VUJ_iJ_iY WKJJ1 KKluv VbCjAbljr JjliljbJLl VJ^ b 1 Vb V 1AY lAijAljXjb 


T A O A 

1 Uz U 


UD 


yo l 


llllll IMIMIII MIIMIIMMIII IIIMM 1 1 II 1 II II 1 1 M 1 1 1 llllll 

T)CT CAtTT P7\ DT O l/TPT TT TT\T T tTT.TTl T~ \ T Tv'T/TP/^T TT 7T - I A AfT T - 'T TT CT rjiT tt-i pTTiPT TrpTi VTUT TVTT f 

R b Lb A VL b AEL b KTb VVDLL YWRD I KKTG VVFG 


1020 


Qy 


1 Uz 1 


T Trp T C CD T VTVPT TT P A T rM/CTMI'PU'nDD A T7T D C DT 7 A T C DDT T 7/~M/"VC"KT O 7\ T TJT TjvT O T 1 T T/dt T~J "D T 

VI lbrKl YKoViyAiyKbJJbbrlFr KA x Lbbh VA1 bbbxjVyKY bNbALOHVNb 1 IKhLKRL 

II 1 1 1 1 M i 1 1 ! 1 1 1 1 1 1 1 1 ll! 1 1 1 II 1 M:IM I 1 II 1 1 1 ! 1 1 M 1 1 1 . 1 1 


1 A O A 

I0B 0 


Db 


1021 


VTISFRIYKGVIQAIQKSDEGHPFRAYLESEVAISEELVQKYSNSALGHVNSTIKELRRL 


1080 


Qy 


1081 


FLVDDLVDSLKFAVLMWVFTYVGALFNGLTLLILALI SLFS I PVI YERHQVQ I DHYLGLA 


1140 


Db 


1081 


II Mi 1 1 1 1 1 1 1 1 Ml IM M 1 1 1 1 II 1 1 1 1 III 1 1 1 II III 1 1 1 1 1 1 1 1 1 i , 

FLVDDLVDSLKFAVLMWVFTYVGALFNGLTLL I LAL I SLFS I PVI YERHQVQ I DHYLGLA 


1140 


Qy 


1141 


NKSVKDAMAKIQAKI PGLKRKA 1162 




Db 


1141 


III 1 1 1 1 Ml 1 M 1 M 1 1 1 1 

NKSVKDAMAKIQAKI PGLKRKA 1162 





RESULT 4 




AAY71384 




ID 


AAY71384 standard; Protein; 1163 AA. 


XX 






AC 


AAY71384; 




XX 






DT 


02-NOV-2000 {first entry) 


XX 






DE 


Alternative version of rat neurite growth inhibitor Nogo A. 


XX 






KW 


Rat; neurite growth inhibitor; Nogo A; neural cell; myelin; CNS; 


KW 


central nervous 


system; neoplastic disease; antiproliferative; glioma; 


KW 


antisense gene 


therapy; neuroblastoma; menagioma; retinoblastoma; 


KW 


degenerative nerve disease; Alzheimer's disease; Parkinson's disease; 


KW 


hyperprol iterative disorder; benign dysprolif erative disorder; diagnos 


KW 


psoriasis; tissue hypertrophy; neuronal regeneration; treatment; 


KW 


structural plasticity; screening. 


XX 






OS 


Rattus sp. 




XX 






FH 


Key 


Location/Qualifiers 


FT 


Inhibitory- site 


1 . . 171 


FT 




/note= "Inhibits NIH 3T3 fibroblast spreading" 


FT 


Modif ied-site 


30 


FT 




/note= "Casein kinase II site" 


FT 


Region 


31. .58 


FT 




/note= "Acidic region" 


FT 


Modif ied-site 


233 


FT 




/note= "Protein kinase C (PKC) site" 


FT 


Modif ied-site 


242 . .244 


FT 




/note= "Asn is N-glycosylated" 


FT 


Modif ied-site 


291 


FT 




/note= "Protein kinase C (PKC) site" 


FT 


Modif ied-site 


295 


FT 




/note= "Protein kinase C (PKC) site" 


FT 


Modif ied-site 


436 


FT 




/note= "Protein kinase C (PKC) site" 


FT 


Modif ied-site 


468 . .470 


FT 




/note= "Asn is N-glycosylated" 


FT 


Modif ied-site 


484 


FT 




/note= "Protein kinase C (PKC) site" 


FT 


Modif ied-site 


488 


FT 




/note= "Protein kinase C (PKC) site" 


FT 


Modif ied-site 


502 


FT 




/note= "Casein kinase II site" 


FT 


Modif ied-site 


576 


FT 




/note= "Casein kinase II site" 


FT 


Modif ied-site 


626 


FT 




/note= "Protein kinase C (PKC) site" 


FT 


Modif ied-site 


694 . .696 


FT 




/note= "Asn is N-glycosylated" 


FT 


Modif ied-site 


715 


FT 




/note= "Casein kinase II site" 


FT 


Modif ied-site 


784 


FT 




/note= "Protein kinase C (PKC) site" 



FT Modified- site 821 

FT /note= "Protein kinase C (PKC) site" 

FT Modif ied-site 850 

FT /note= "Protein kinase C (PKC) site" 

FT Modif ied-site 855 

FT /note- "Protein kinase C (PKC) site" 

FT Modif ied-site 863 

FT /note= "Casein kinase II site" 

FT Modif ied-site 8 68 

FT /note= "Protein kinase C ( PKC) site" 

FT Modif ied-site 8 93 

FT /note= "Protein kinase C (PKC) site" 

FT Modif ied-site 912.. 914 

FT /note= "Asn is N-glycosylated" 

FT Modif ied-site 925.. 927 

FT /note= "Asn is N-glycosylated" 

FT Modif ied-site 954 

FT /note= "PKC and casein kinase II sites" 

FT Modif ied-site 956 

FT /note= "PKC and casein kinase II sites" 

FT Domain 988 . . 1023 

FT /label = Transmembrane_domain 

FT /note= "C- terminal hydrophobic region" 

FT Modif ied-site 1024 

FT /note- "Protein kinase C (PKC) site" 

FT Modif ied-site 1071.. 1073 

FT /note= "Asn is N-glycosylated" 

FT Modif ied-site 1073 

FT /note= "Protein kinase C (PKC) site" 

FT Modif ied-site 108 9 

FT /note= "Protein kinase C (PKC) site" 

FT Domain 1090. . 1125 

FT /label = Transmembrane_domain 

FT /note= "C- terminal hydrophobic region" 

FT Modif ied-site 1141.. 1143 

FT /note- "Asn is N-glycosylated" 

FT Modif ied-site 1143 

FT /note= "Protein kinase C (PKC) site" 

FT Peptide 623.. 640 

FT /note= "used as immunogen to generate antibody AS 472" 

FT Peptide 762.. 1163 

FT /note= "used as immunogen to generate antibody AS Bruna" 

FT Inhibitory-site 542.. 722 

FT Region 172. .259 

FT /note- "This region is not essential for inhibitory 

FT activity" 

FT Region 975 . . 1162 

FT /note- "This region is not essential for inhibitory 

FT activity" 

FT Region 976.. 1163 

FT /note- "C- terminal common region found in Nogo A # B and 

FT C isoforms" 

FT Misc-dif ference 223 

FT /label= Unknown 

FT /note- "There is Leu at this position in the 

FT sequence shown in AAY71310" 

FT Misc-dif ference 4 04 



FT /note= "There is lie at this position in the 

FT sequence shown in AAY71310" 

FT Misc-dif f erence 469 

FT /labels Unknown 

FT /note= "There is Lys at this position in the 

FT sequence shown in AAY71310" 

FT Misc-dif f erence 661 

FT /note= "There is Asn at this position in the 

FT sequence shown in AAY71310" 

FT Misc-dif f erence 820 

FT /note= "There is Leu at this position in the 

FT sequence shown in AAY71310" 

XX 

PN WO200031235-A2 . 
XX 

PD 02-JUN-2000. 
XX 

PF 05-NOV-1999; 99WO-US26160 . 
XX 

PR 06-NOV-1998; 98US- 0107446 . 
XX 

PA (SCHW/) SCHWAB M E. 

PA (CHEN/) CHEN M S - 
XX 

PI Schwab ME , Chen MS; 
XX 

DR WPI; 2000-400052/34. 
XX 

PT Nogo proteins and nucleic acids useful for treating neoplastic 

PT disorders of the central nervous system and inducing regeneration of 

PT neurons - 

XX 

PS Claim 3; Fig 13; 122pp; English. 
XX 

CC The present sequence is an alternative version of rat Nogo A protein 

CC which is a potent neural cell growth inhibitor and is free of all 

CC central nervous system (CNS) myelin material with which it is 

CC natively associated. Nogo proteins and fragments displaying 

CC neurite growth inhibitory activity are used in the 

CC treatment of neoplastic disease of the CNS 

CC e.g. glioma, glioblastoma, medulloblastoma , craniopharyngioma, 

CC ependyoma, pinealoma, haemangioblastoma , acoustic neuroma, 

CC oligodendroglioma, menagioma, neuroblastoma or retinoblastoma and 

CC degenerative nerve diseases e.g. Alzheimer's and Parkinson's diseases. 

CC Therapeutics which promote Nogo activity can be used to treat or prevent 

CC hyperprol iterative or benign dysprolif erative disorders e.g. psoriasis 

CC and tissue hypertrophy. Ribozymes or antisense Nogo nucleic acids can be 

CC used to inhibit production of Nogo protein to induce regeneration of 

CC neurons or to promote structural plasticity of the CNS in disorders where 

CC neurite growth, regeneration or maintenance are deficient or desired. 

CC The animal models can be used in diagnostic and screening methods for 

CC predisposition to disorders and to screen for or test molecules which 

CC can treat or prevent disorders or diseases of the CNS. 

CC Note: The present sequence is an alternative version of the 

CC Nogo A sequence shown in Fig. 2A (see AAY71310) . 

CC SEQ ID numbers 35-42 are referred in claim 32 and SEQ ID NO: 29 

CC in disclosure of the specification. However the specification does not 



CC include sequences for these SEQ ID numbers. 
XX 

SQ Sequence 1163 AA; 



Query Match 99.6%; Score 5823; DB 21; Length 1163; 

Best Local Similarity 99.7%; Pred. No. 2.1e-295; 

Matches 1159; Conservative 0; Mismatches 4; Indels 0; Gaps 0; 
Qy 1 MEDIDQSSLVSSSTDSPPRPPPAFKYQFVTEPEDEEDEEEEEDEEEDDEDLEELEVLERK 60 

II! II 1 IIMI I IMIIIillllllllill Mill! M llllll lllll III IIMIII 

Db 1 MEDIDQSSLVSSSTDSPPRPPPAFKYQFVTEPEDEEDEEEEEDEEEDDEDLEELEVLERK 60 

Qy 61 PAAGLSAAAVPPAAAAPLLDFSSDSVPPAPRGPLPAAPPAAPERQPSWERSPAAPAPSLP 12 0 

1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

Db 61 PAA.GLSAAAVPPAAAAPLLDFSSDSVPPAPRGPLPAAPPAAPERQPSWERSPAAPAPSLP 12 0 

Qy 121 PAAAVLPSKLPEDDEPPARPPPPPPAGASPLAEPAAPPSTPAAPKRRGSGSVDETLFALP 180 

I I I I M I I I I : M; I I I MMMMM MM 

Db 121 PAAAVLPSKLPEDDEPPARPPPPPPAGASPLAEPAAPPSTPAAPKRRGSGSVDETLFALP 18 0 

Qy 181 AASEPVIPSSAEKIMDLMEQPGNTVSSGQEDFPSVLLETAASLPSLSPLSTVSFKEHGYL 240 

IIIIIIIIIIIIIIIIIIIIIIIIIIIIMIMIIIIIIIII 1 1 1 1 1 1 1 1 1 1 1 II 

Db 181 AASEPVIPSSAEKIMDLMEQPGNTVSSGQEDFPSVLLETAASXPSLSPLSTVSFKEHGYL 24 0 

Qy 241 GNLSAVSSSEGTIEETLNEASKELPERATNPFVNRDLAEFSELEYSEMGSSFKGSPKGES 3 00 

I M I M M I II J I M M MMM i MM 

Db 241 GNLSAVSSSEGTIEETLNEASKELPERATNPFVNRDLAEFSELEYSEMGSSFKGSPKGES 3 00 

Qy 301 AILVENTKEEVIVRSKDKEDLVCSAALHSPQESPVGKEDRWSPEKTMDIFNEMQMSWA 360 

MIIIIMIIIIIIIIIIIIIIIMIIIIIIIIIMMIIIIIIIMIIIIIIIIIIIII 

Db 301 AILVENTKEEVIVRSKDKEDLVCSAALHSPQESPVGKEDRWSPEKTMDIFNEMQMSWA 360 

Qy 361 PVREEYADFKPFEQAWEVKI)TYEGSRDV1AARANVESKVDRKCLEDSLEQKSLGKDSEGR 420 

Db 361 PWEEYADFKPFEQAWEVKDTYEGSRDV1AARANVESKVDRKCLEDSLEQKSLGKDSEGR 42 0 

QY 421 NEDASFPSTPEPVKDSSRAYITCASFTSATESTTANTFPLLEDHTSENKTDEKKIEERKA 480 

M II II I I MM I MM I II III Ml I 

Db 421 NEDASFPSTPEPVKDSSRAYITCASFTSATESTTANTFPLLEDHTSENXTDEKKIEERKA 480 

Qy 481 QI I TEKTS PKTSNPFLVAVQDSEADYVTTDTLSKVTEAAVSNMPEGLTPDLVQEACESEL 54 0 

I 1 1 1 II II; 1 1 1 il ! I I ' II I II Ml 1 1 ,. 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1! 1 1 1 M 

Db 481 QIITEKTSPKTSNPFLVAVQDSEADYVTTDTLSKVTEAAVSNMPEGLTPDLVQEACESEL 54 0 

Qy 541 NEATGTKIAYETKVDLVQTSEAIQESLYPTAQLCPSFEEAEATPSPVLPDI VMEAPLNSL 600 

: ! 1 1 In 1 1 1 1 ; h I III I! I II 1 1; N I 1 1 1 1 M 1 1! I ll II 1 1 1 M 1 1 1 ! II 

Db 541 N EATGTKIAYETKVDLVQTSEA I QESLYPTAQLCPSFEEAEATPSPVLPD I VMEAPLNSL 600 

Qy 601 LPSAGASWQPSVSPLEAPPPVSYDSIKLEPENPPPYEEAMNVALKALGTKEGIKEPESF 660 

MMM MMM MM MM' MM IM M II IM 

Db 601 LPSAGAS WQPS VSPLEAPPPVSYDSI KLEPENPPPYEEAMNVALKALGTKEGI KEPESF 660 

Qy 661 NAAVQETEAPYISIACDLIKETKLSTEPSPDFSNYSEIAKFEKSVPEHAELVEDSSPESE 720 

■ 1 1 M II II 1 1 1 li ., I II I II I II III I II 1 1 1 1 1 1 II 1 1 1 1 , 1 1 1 II 1 1 MM 

Db 661 MAAVQETEAPYISIACDLIKETKLSTEPSPDFSNYSEIAKFEKSVPEHAELVEDSSPESE 720 

Qy 721 PVDLFSDDSIPEVPQTQEEAVMLMKESLTEVSETVAQHKEERLSASPQELGKPYLESFQP 780 



I i I I U I I I I I I'l M U MM I IN I I 

PVDLFSDDSI PEVPQTQEEAVMLMKESLTEVSETVAQHKEERLSASPQELGKPYLESFQP 780 



Db 


721 


PVDLFSDDSI PEVPQTQEEAVMLMKESLTEVSETVAQHKEERLSASPQELGKPYLESFQP 


780 


Qy 


781 


NLHSTKDAASNDI PTLTKKEKISLQMEEFNTAI YSNDDLLSSKEDKIKESETFSDSSPIE 
1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 


840 


Db 


781 


1 1 1 1 1 M 1 II 1 ! 1 1 1 1 i 1 1 1 M 1 1 1 1 1 1 1 1 I M 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i I 1 

NLHSTKDAASNDI PTLTKKEKISLQMEEFNTAI YSNDDLSSSKEDKIKESETFSDSSPIE 


840 


Qy 


841 


IIDEFPTFVSAKDDSPKLAKEYTDLEVSDKSEIANIQSGADSLPCLELPCDLSFKNIYPK 
1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 


900 


Db 


841 


1 1 1 1 1 II 1 1 1 1 I 1 1 1 1 1 1 1 1 1 M 1 II 1 1 1 1 II 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
IIDEFPTFVSAKDDSPKLAKEYTDLEVSDKSEIANIQSGADSLPCLELPCDLSFKNIYPK 


900 


Qy 


901 


DEVHVSDEFSENRSSVSKASISPSNVSALEPQTEMGSIVKSKSLTKEAEKKLPSDTEKED 
1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 


960 


Db 


901 


1 1 1 1 1 II M 1 1 1 1 1 1 1 1 1 1 M I II 1 1 1 1 1 1 M i 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 M 1 1 
DEVHVSDEFSENRSSVSKASISPSNVSALEPQTEMGSIVKSKSLTKEAEKKLPSDTEKED 


960 


Qy 


961 


RSLSAVLSAELSKTSVVDLLYWRDIKKTGWFGASLFLLLSLTVFSI VSVTAYIALALLS 
1 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 II 1 1 1 M 1 I 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 


1020 


Db 


961 


M 1 1 1 M 1 M 1 ! 1 1 1 1 ! 1 1 M II 1 1 II 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i ! 1 

RSLSAVLSAELSKTSVVDLLYWRD I KKTGWFGASLFLLLSLTVFS I VSVTAYIALALLS 


102 0 


Qy 


1021 


VTI SFRI YKGVI QAI QKSDEGHPFRAYLESEVAI SEELVQKYSNSALGHVNSTI KELRRL 
1 1 1 1 1 1 1 1 1 1 1 If 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 t 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 


1080 


Db 


1021 


1 1 1 I 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 M II 1 1 1 1 1 1 1 M 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 M 1 

VTI SFRI YKGVI QAI QKSDEGHPFRAYLESEVAI SEELVQKYSNSALGHVNSTI KELRRL 


1080 


Qy 


1081 


FLVDDLVDSLKFAVLMWVFTYVGALFNGLTLLI LALI SLFS I PVI YERHQVQ I DHYLGLA 


1140 


Db 


1081 


IIIIIIIIIIIIIMIIIIIIIIIIIIIMIIIIMIIIIIMIIIIIIMIIIIIIIII 

FLVDDLVDSLKFAVLMWVFTYVGALFNGLTLLI LALI SLFS I PVI YERHQVQ I DHYLGLA 


1140 


Qy 


1141 


NKSVKDAMAKIQAKI PGLKRKAD 1163 




Db 


1141 


1 Mill III MINI II MM II 

NKSVKDAMAKIQAKI PGLKRKAD 1163 





RESULT 5 
AAY71560 



ID AAY71560 standard; Protein; 974 AA. 
XX 

AC AAY71560; 
XX 

DT 02-NOV-2000 (first entry) 
XX 

DE 1 Rat Nogo A protein fragment used in the construction of mutant NiAext . 
XX 

KW Rat; neurite growth inhibitor; Nogo A; neural cell; myelin; CNS; 

KW central v nervous system; neoplastic disease; antiproliferative; glioma; 

KW antisense gene therapy; neuroblastoma; menagioma; retinoblastoma; 

KW degenerative nerve disease; Alzheimer's disease; Parkinson's disease; 

KW hyperprolif erative disorder; benign dysprol if erative disorder; diagnosis; 

KW psoriasis; tissue hypertrophy; neuronal regeneration; treatment; 

KW structural plasticity; screening; mutant; mutein. 

XX 

OS Rattus sp. 
XX 

PN WO200031235-A2 . 
XX 

PD 02-JUN-2000. 
XX 

PF 05-NOV-1999; 99WO-US26160 . 



XX 

PR 06-NOV-1998; 98US-0107446 . 
XX 

PA (SCHW/) SCHWAB M E. 

PA (CHEN/) CHEN M S. 
XX 

PI Schwab ME, Chen MS; 
XX 

DR WPI; 2000-400052/34. 
XX 

PT Nogo proteins and nucleic acids useful for treating neoplastic 

PT disorders of the central nervous system and inducing regeneration of 

PT neurons - 

XX 

PS Example; Page -; 122pp; English. 
XX 

CC The patent relates to neurite growth inhibitor Nogo which is free of 

CC all central nervous system (CNS) myelin material with which it is 

CC natively associated. Nogo proteins and fragments displaying neurite 

CC growth inhibitory activity are used in the treatment of neoplastic 

CC disease of the CNS e.g. glioma, glioblastoma, medulloblastoma , 

CC craniopharyngioma, ependyoma, pinealoma, haemangioblastoma , acoustic 

CC neuroma, oligodendroglioma, menagioma, neuroblastoma or retinoblastoma 

CC and degenerative nerve diseases e.g. Alzheimer's and Parkinson's 

CC diseases. Therapeutics which promote Nogo activity can be used to treat 

CC or prevent hyperprolif erat ive or benign dysprol iterative disorders e.g. 

CC psoriasis and tissue hypertrophy. Ribozymes or antisense Nogo nucleic 

CC acids can be used to inhibit production of Nogo protein to induce 

CC regeneration of neurons or to promote structural plasticity of the CNS 

CC in disorders where neurite growth, regeneration or maintenance are 

CC deficient or desired. The animal models can be used in diagnostic and 

CC screening methods for predisposition to disorders and to screen for or 

CC test molecules which can treat or prevent disorders or diseases of the 

CC CNS. The present sequence is a fragment of rat Nogo A protein shown in 

CC AAY71310, which is used in the construction of mutant NiAext. The mutant 

CC is composed of His-tag/T7-tag/vector/Nogo-A sequence aa l-974/T7-tag. 

CC Nogo A deletion mutants were used for mapping the inhibitory sites of 

CC Nogo protein. Major inhibitory region was identified in the 

CC Nogo A sequence from amino acids 172-974, particularly amino acids 

CC 542-722. In addition, N-terminal region 1-171 was found to be inhibitory 

CC to NIH 3T3 fibroblast spreading. 

CC Note: The present sequence is not given in the specification but is 

CC derived from rat Nogo A sequence shown in AAY71310. SEQ ID numbers 35-42 

CC are referred in claim 32 and SEQ ID NO: 29 in disclosure of the 

CC specification. However, the specification does not include sequences for 

CC these SEQ ID numbers. 

XX 

SQ Sequence 974 AA; 



Query Match 84.1%; Score 4921; DB 21; Length 974; 

Best Local Similarity 99.9%; Pred. No. 1.9e-248; 

Matches 973; Conservative 1; Mismatches 0; Indels 0; Gaps 0; 



Qy 1 MEDIDQSSLVSSSTDSPPRPPPAFKYQFVTEPEDEEDEEEEEDEEEDDEDLEELEVLERK 60 

1 1 1 I hi! ! 1 1 II ' 1 1 III 1 1 1 1 ! II I II I II I 1 1 1 1 1 li 1 1 1 II 1 1! II I II I I 

Db 1 MEDIDQSSLVSSSTDSPPRPPPAFKYQFVTEPEDEEDEEEEEDEEEDDEDLEELEVLERK 60 



Qy 61 PAAGLSAAAVPPAAAAPLLDFSSDSVPPAPRGPLPAAPPAAPERQPSWERSPAAPAPSLP 120 

llllll Ml III IIIIIEI I MMIII jlllMlillllMIIIII llllllllllil I 

Db 61 PAAGLSAAAVPPAAAAPLLDFSSDSVPPAPRGPLPAAPPAAPERQPSWERSPAAPAPSLP 120 

Qy 121 PAAAVLPSKLPEDDEPPARPPPPPPAGASPLAEPAAPPSTPAAPKRRGSGSVDETLFALP 180 

MIMIII MIMIIMM MMIIMM MMIMIMM IMMIIIIII I 

Db 121 PAAAVLPSKLPEDDEPPARPPPPPPAGASPLAEPAAPPSTPAAPKRRGSGSVDETLFALP 180 

Qy 181 AASEPVI PSSAEKIMDLMEQPGNTVSSGQEDFPSVLLETAASLPSLSPLSTVSFKEHGYL 24 0 

IMIIIIII MMIIMM IMIMIMI MIMMIMM MIMIMMMM 

Db 181 AASEPVI PSSAEKIMDLMEQPGNTVSSGQEDFPSVLLETAASLPSLSPLSTVSFKEHGYL 24 0 

Qy 241 GNLSAVSSSEGTIEETLNEASKELPERATNPFVNRDLAEFSELEYSEMGSSFKGSPKGES 300 

IIIIIMIIIIIIIIIIIIIIIMIIIIIIIIIIIMIIIIIIIIIIMIIIIIIIIIII 

Db 241 GNLSAVSSSEGTIEETLNEASKELPERATNPFVNRDLAEFSELEYSEMGSSFKGSPKGES 300 

Qy 301 AILVENTKEEVI VRSKDKEDLVCSAALHSPQESPVGKEDRWSPEKTMDIFNEMQMSWA 360 

IIIIIMIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIMIIIIMIMIIIIM 

Db 301 AILVENTKEEVI VRSKDKEDLVCSAALHSPQESPVGKEDRWSPEKTMDI FNEMQMSWA 360 

Qy 361 PVREEYADFKPFEQAWEVKDTYEGSRDVLAARANVESKVDRKCLEDSLEQKSLGKDSEGR 420 

I I I I ' I , I: I I I I : I IM h : I M hi 

Db 3 61 PWEEYADFKPFEQAWEVKDTYEGSRDVLAARANVESKVDRKCIEDSLEQKSLGKDSEGR 42 0 

Qy 421 NEDASFPSTPEPVKDSSRAYITCASFTSATESTTANTFPLLEDHTSENKTDEKKIEERKA 480 

IMIIMI IMIMIMI MMIIMM MIMMMIM IMMIIIIIMIII 

Db 421 NEDASFPSTPEPVKDSSRAYI TCASFTSATESTTANTFPLLEDHTSENKTDEKKI EERKA 480 

Qy 481 QIITEKTSPKTSNPFLVAVQDSEADYVTTDTLSKVTEAAVSNMPEGLTPDLVQEACESEL 540 

Db 481 QIITEKTSPKTSNPFLVAVQDSEADYVTTDTLSKVTEAAVSNMPEGLTPDLVQEACESEL 540 

Qy 541 NEATGTKIAYETKVDLVQTSEAIQESLYPTAQLCPSFEEAEATPSPVLPDIVMEAPLNSL 600 

IMIIIIIIMIMIIIMIIMIIIIIIIIIIIIIIIIIIIMIIIMMIIIIIIIM 

Db 541 NEATGTKI AYETKVDLVQTS EAI QESLYPTAQLCPS FEEAEATPS P VLPD I VMEAPLNSL 600 

Qy 601 LPSAGASWQPSVSPLEAPPPVSYDSIKLEPENPPPYEEAMNVALKALGTKEGIKEPESF 660 

liMIIMMIMIIIIIIIIIIIIillllMIIIIIIIIIIIIIMMMIIIIMIII 

Db 601 LPSAGASWQPSVSPLEAPPPVSYDSIKLEPENPPPYEEAMNVALKALGTKEGIKEPESF 660 

Qy 661 NAAVQETEAPYISIACDLIKETKLSTEPSPDFSNYSEIAKFEKSVPEHAELVEDSSPESE 720 

MIIIIIIIIIIMIIIMIIMIIIIIMIIIIIIIIIIIIIIIIIIIMIIIIIIIII 

Db 661 NAAVQETEAPYISIACDLIKETKLSTEPSPDFSNYSEIAKFEKSVPEHAELVEDSSPESE 720 

Qy 721 PVDLFSDDSI PEVPQTQEEAVMLMKESLTEVSETVAQHKEERLSASPQELGKPYLESFQP 780 

IIIIMIMIIIIIIIIIIIIIIIIIMIIIIMIIIIIIIIIIIIIIIIIIMIIIIII 

Db 721 PVDLFSDDSI PEVPQTQEEAVNLMKESLTEVSETVAQHKEERLSASPQELGKPYLESFQP 78 0 

Qy 781 NLHSTKDAASNDIPTLTKKEKISLQMEEFNTAI YSNDDLLSSKEDKIKESETFSDSSPIE 840 

MMM MillMIMI IIM'liilM lilllllllli JIIIIIIIM MM 

Db 781 NLHSTKDAASNDI PTLTKKEKI SLQMEEFNTAI YSNDDLLSSKEDKI KESETFSDSS PI E 840 

Qy 841 I IDEFPTFVSAKDDSPKLAKEYTDLEVSDKSEIANIQSGADSLPCLELPCDLSFKNI YPK 900 

IMM MMMMIMI IMMMMIM IMMIMM MIMMMMI Mill 

Db 841 I IDEFPTFVSAKDDSPKLAKEYTDLEVSDKSEIANIQSGADSLPCLELPCDLSFKNI YPK 900 



Qy 



901 DEVHVSDEFSENRSSVSKASISPSNVSALEPQTEMGSI VKSKSLTKEAEKKLPSDTEKED 96 0 



IIIIMI . llilllllMMIIIIIIIIIIilllMIIIMM , Illlllll 

Db 901 DEVHVSDEFSENRSSVSKASISPSNVSALEPQTEMGSIVKSKSLTKEAEKKLPSDTEKED 960 



Qy 961 RSLSAVLSAELSKT 974 

IIIIIIIIIMIII 
Db 961 RSLSAVLSAELSKT 974 



RESULT 6 
AAU04591 

ID AAU04591 standard; Protein; 1192 AA. 
XX 

AC AAU04591; 
XX 

DT 26-SEP-2001 (first entry) 
XX 

DE Human Nogo protein. 
XX 

KW Human; Nogo receptor; axonal growth; immunogen; antibody; nogo protein ; 

KW cranial trauma; cerebral trauma; spinal cord injury; stroke; 

KW demyelinating disease; multiple sclerosis; monophasis demyelination; 

KW encephalomyelitis ; multifocal leukoencephalopathy; panencephalitis ; 

KW Marchiaf ava-Bignami disease; pontine myelinolysis ; adrenoleukodys trophy; 

KW Pelizaeus-Merzbacher disease; Spongy degeneration; Alexander's disease; 

KW Canavan's disease; metachromatic leukodystrophy; viral infection; 

KW Krabbe ' s disease. 

XX 

OS Homo sapiens. 



XX 

FH Key Location/Qualifiers 

FT Domain 1054.. 1119 

FT /label= Lumenal_extracellular_domain 

FT /note= "This sequence is specifically claimed" 

FT Peptide 1055 . . 1094 

FT /label= Pepl 

FT /note= "Receptor binding inhibitory peptide. This 

FT sequence is specifically claimed" 

FT Peptide 1064 . . 1088 

FT /label= Pep2 

FT /note= "Receptor binding inhibitory peptide. This 

FT sequence is specifically claimed" 

FT Peptide 1074.. 1098 

FT /label= Pep3 

FT /note- "Receptor binding inhibitory peptide. This 

FT sequence is specifically claimed" 

FT Peptide 1084 . . 1108 

FT /label- Pep4 

FT /note- "Receptor binding inhibitory peptide. This 

FT sequence is specifically claimed" 

FT Peptide 1095 . . 1119 

FT s /label- Pep5 

FT /note- "Receptor binding inhibitory peptide. This 

FT sequence is specifically claimed" 

XX 



PN WO200151520-A2 . 
XX 

PD 19-JUL-2001. 



XX 

PF 12-JAN-2001; 2 00 1WO-US0104 1 . 
XX 

PR 12-JAN-2000; 2000US-0175707 . 

PR 26-MAY-2000; 2 000US-0207366 . 

PR 29-SEP-2000; 2 000US-0236378 . 
XX 

PA (UYYA ) UNIV YALE . 
XX 

PI Strittmatter SM; 
XX 

DR WPI; 2001-442138/47. 

DR N-PSDB; AAS09453 . 
XX 

PT Novel Nogo receptor protein useful for identifying modulator of Nogo 

PT protein or Nogo receptor protein, which is useful for treating central 

PT nervous system disorders 
XX 

PS Example 1; Page 101-104; 109pp ; English. 
XX 

CC The sequence is the human Nogo protein, a 250kDa myel in-associated axon 

CC growth inhibitor. The invention relates to the use of the nogo receptor, 

CC nogo protein, their nucleic acids, vectors expressing them and antibodies 

CC against them, to isolate agents which block nogo receptor mediated axonal 

CC growth. The agent is useful for treating a central nervous system 

CC disorder which is a result of cranial or cerebral trauma, spinal cord 

CC injury, stroke or a demyelinat ing disease selected from multiple 

CC sclerosis, monophasis demyelinat ion, encephalomyelitis, multifocal 

CC leukoencephalopathy, panencephalitis, Marchiaf ava-Bignami disease, 

CC pontine myelinolysis , adrenoleukodys trophy, Pelizaeus-Merzbacher disease, 

CC Spongy degeneration, Alexander's disease, Canavan's disease, 

CC metachromatic leukodystrophy, viral infection and Krabbe ' s disease. 

XX 

SQ Sequence 1192 AA; 



Query Match * 75.3%; Score 4403.5; DB 22; Length 1192; 

Best Local Similarity 75.9%; Pred. No. 2.5e-221; 

Matches 909; Conservative 104; Mismatches 145; Indels 39; Gaps 20; 
Qy 1 MEDIDQSSLVSSSTDSPPRPPPAFKYQFVTEPEDEEDEEEEEDEEEDDEDLEELEVLERK 60 

Mhlll Mill IIIIII MINIM IIIIII 1 1 1 1 1 = j I = = I [ ! 1 11 1 1 1 1 1 1 1 

Db 1 MEDLDQSPLVSSS-DSPPRPQPAFKYQFVREPEDEE-EEEEEEEEDEDEDLEELEVLERK 58 

Qy 61 PAAGLSAAAVP - - PAAAAPLLDFSSDS VPPAPRGPLPAAPPAAPERQPSWERS PAA P 115 

IIIIIIII II III llhll U MUM IIIIIIII: || : | 

Db 59 PAAGLSAAP VPTAPAAGAPLMDFGNDF VP PAPRGPLPAAP P VAPERQPSWDPS P VSSTVP 118 

Qy 116 APSLPPAAAVLPSKLPEDDEPPARPPPPPPAGASPLAE PAAPPSTPAAPKR 166 

Ml MM 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 f II II IIIMIIIIIMI 

Db 119 APSPLSAAAVSPSKLPEDDEPPARPPPPPPASVSPQAEPVWTPPAPAPAAPPSTPAAPKR 178 

Qy 167 RG-SGSVDETLFALPAASEPVI PSSAEKIMDLMEQPGNTVSSGQEDFPSVLLETAASLPS 225 

II 1 1 1 , 1 1 1 J 1 1 1 1 1 [ 1 1 ! i I MM Ml UMImHMM MIMIMIIIII 

Db 179 RGSSGSVDETLFALPAASEPVIRSSAEN-MDLKEQPGNTISAGQEDFPSVLLETAASLPS 237 



Qy 226 LSPLSTVSFKEHGYLGNLSAVSSSEGTIEETLNEASKELPERATNPFVNRDIiAEFSELEY 285 

Mill Mill IMMI I - M I " : j ::|||||: M -Ml MIIMI 



Db 



238 LSPLSAASFKEHEYLGNLSTVLPTEGTLQENVSEASKEVSEKAKTLLIDRDLTEFSELEY 297 



Ov 

vy 


286 


^EMO^RKO^PKnESATTiVENTKFEVTVP^KDKRD T.VP^A AT .R^POFClp vH- KED 


33 9 


Db 


298 


llllll III MhM 1 :|Mh:|hh 1 : Ih 1 I III 

^RMO^F^V^PKAECJAVT VANPRERT TVKMKDPFFRT A/^NWT T iHKTOnFT.PTATiTKT A/PCRD 

O CjI'Iuu O IT O v »_> x lulLiun v J. V^-vLM l XvJZjIlt X X v I\XN IvXV JZj Hi Hi IVJ_I V OlMLN J. XJfllN V^^HiXj x X r\l-l X IVXJ V iVXjL/ 


357 


Ov 

vy 


340 


RWSPEKTMDIFNEMOMSVVAPVREEYADFKPFFOAWRVKnTYRO^RnW.AA RANV 

IV v V iJt XJ IV X 1 1X-* X J. i>l J— 11 •I'y'l IhJ V V ±x L V IVJ—I Xj A C^i-J L 1 VX L J_i^^^i.yi l_l V 1VL/ J. X J_J\J O IV X/ V XXtixi. IVTiiil V 

III II 1 III ^ 1 IhlMlllllllh llllh 1 1 hill \- 


3 95 


Db 


358 


EVVSSEKAKIOSFNEKjRVAVEAPMREEYADPK^ 

xj v v i_i J— i i. am xj iu\. v n v un i. i iivxj i— i x n.\—J \. i\.r i. i_jiv v rv J— i v ivi_/ 1_) I vili XJ i_) i j i *i i i a-\ i-\ t h ti\ i Xj xj 


4 1 fi 

rt J- (J 


Ov 


396 


ESKVDRKCLEDSLEOKSLGKDSEGRNEDASFPSTPEPVKDS^RAYTTOA^RT-^ATR^TT 


454 


Db 


417 


lllhll Mill : MM hi lllllll Ml 1 llllll 1 Mill 

ESKVDKKrFADSLEOTNHEKDSESSNDDT^FP^TPEGTKDR^GAYTTrAPFMPAATR^T A 

XJ hJ X V V J_* X VI V V-f X xi.X^ i^J XJXJN^/ X X* 1 1U I VIVlJ XJ l^/ lJ X>J L/L/ J. |J X XT t_? _J_ 17 X_JV_J _L X VU jTV u-J V_J fT X J. X VJlt X/ LM XT Xi/i X i—J J- Xt 




vy 


4 55 


AMTRPTJiRDHT^RNKTHRKK'T RRRKAOT TTRK-T^PKT^MPRT,\7A\70nc;RAnV\7TTnTT.C! 

xtXM I r r XJXJCjLjTI X O L_»1M IV X XJCj rvrvX Hi rjXvlvrt.^X X X J_i iv _L O Jr XV J-Oiilrr Jj Vfi VUUiD ".Ml j X v 1 ± LJ ± Xjo 


jij 


Db 


477 


1 MM 1 ..MINI II III MM llllll h 

TNIFPLLGDPT^ENKTDEKKIEEKKAOI VTEKj^TT^TKT^NPFTiVAAOD^RTnYVTTnNT.T 

X J-V X J. 1_ XJXJA — 1 1_> X 1 UUINXVI XVUI VIVX XJ i—i I VI VTiVj,/ X V X I— 1 1 \XM 1J I IV 1 UlM T 1 XJ V J^CW^LJtZ} 1— i X XJ X V X X XJX4 XJ X 


-J o o 


Ov 
vy 


514 


KVTEAAV^NMPFGT .TPOT .VORAOR^RT .NRATOTKT A YRTKVDT .VOT^RA T DPQT .YPTAOT , 

iv v x i—irtJ-v voiMur DUUiruuv v K /J_Lrt.\ — Cj tZ) t-iXJlv J_ixx X vjj X IvX ri X I_i X rv v i_/xJ V ^ X O Cirt X Vj^/LjO Xj I r I ^vv^/Xj 

MM Mlllhllhll II llllll II III III hill llll hlllll Ml 


CTT 

3 / J 


Db 


537 


KVTEEWANMPRGT iTPDT A/ORAOR^RT .NRVTGTKT A YRTKMnT .VOT^RVMOR^ T .YPA AOT . 

IVV X i—i l_i V V xlXiJ 11 X Uv_JXJ X X X/XJ V ^ ' ■ n v i—iO 1_jXJLV Hi V X v_J X IVX rt. X Xj X I \ 1 1 1 ■/ 1 I V W X O X-i v 1 IvLiU XJ X r/iAy 1 1 




Ov 
vy 


574 


OPSFEEAEATP^PVTiPDT VMEAPTiN^TiT.P^AGA^^/^OP^V^PT^RA PPP^/^YDCIT KT.RPRN 

i tji LiLinijn x x. i_j x v xjx xv x v i ji_in t xjxm ijuur ijiiviiiiJ v v ^ r j v u r i 1 1 » /-\ r* r r v j 1 1— 'O x ivxji_i l xjXj 


^ 

UJ J 


Db 


597 


llllhlll llhllhll II llll MlllllhIM Mill hhlll llll 

OPSFEE^EATP^PVTiPDT WIRAPT,N^AVP^AOA^VTOP^^^PT.RA -^^WYR^T KHRPRM 


Ccr 

D 3 3 


Ov 
vy 


634 


PPPYEEAMNVATiKATi-OTKROT KRPR^RMAAVORTRAPYT ^ T APDT.T KRTKT.CJTRP^PDR 

X X X x i_i Xxrvi'llM v -rt. J_i ivri J_i \j X IvHjVJT X XVCj l do IT XJ-rtjri. V X r>rA tr JL X O X 2t.v_ J_/XJ X XvCj x XvXJiZ) 1 DrO r J_/x 

Mlllllhhll : 1 II llllh MM III hhlllh hlllll Ihlll 




Db 


656 


PPPYRRAM^V^TiKKV^OT KRRT KRPRMTMA AT .ORTRAPYT AOm.T KPTKT.Q APPA PnR 


/ Id 


Ov 
vy 


693 


^NY^RT AKFRK c !VPRHART t VFr) c J c: IPR c ^RP\7TjT.R c ^nnc! T PRVPOTORRA\7TVFT .MKPQT TR\7Q 

LjlM IQDI rt.IVir Hi IVD V x OIXXT. J_i XJ V i-jJ^OOxXjOIliir V L/XJ X OL/JJu 1 x Xj V x^i. w PiH V i v l Xji v l IVJ_J J Xj 1 P. Vfl 




Db 


716 


hllhll h IhhIMMIIhllllllllllllhlll hi llhllllll 1 

^HY^PMAKVROPVPnWQRT A/RnQQPnQRP^TTiT.RQnnClT Pn\n3Ht<rnnP'T'\71VTT \7VWQJ TPTC! 


1 1 3 


Ov 
vy 




-RTVAOWK'-RRRT.QA < ^PnRT.nT< r PVT,RCIpnPMT WQT'k'riA A CMm DTT TKWWT CT flMPfP 
J-i x vrt.ynx\. xjxjXvXjiZ3/-iiZ) r ^LjxjoiVir I xjiLor ^ir iNJxjllo 1 JVU/\— /\olNIUl r ILi ivivxirvX j±j\JyilliCiC 


Q fi Q 

buy 


Db 


776 


h: MMM 1 1 IIIMMh M MM ^ Ihlllll Mill 

FESMIEYENKEKLSALPPEGGKPYLE^RKTt^TiDNTKDTT ■T,PTjRV c ;tT. c ;kKRK'T PT.OMRRT, 

i- LikJiiiui XjI_m I \.i_il\JJiJrlXI t X XjVJVJlVt X XJIiikJ I IVXJO XJX>L>I X iuJ X XlUrLTi VO lUu iviv£jXVX L i_l 1 i_» J_i J_l 


ft ^ R 


Ov 
vy 


810 


i-M IAI X OIvX^IJxjXjOOIvIIjJ-JivX IVJZjiD I-i X X jL/jj x X Hi X X JJHiT x 1 X V Oi^AXJIJO IT xvJ-xrt XV Xj X 1 1 J 1 i r* VO 


Q ^ Q 


Db 


836 


MhllllM III MIIIM MM II Ml 

c JTA\7Y c IMr)nT,RT QPTRAOT P PTPTRQTiQQ P T RT T HPPPTT T C CT^TFiCPCT^T 7S. D PVTFiT PVC 


OJJ 


Ov 
vy 


8 69 


DKSEIANIQSGADSLPCLELPCDLSFKNI YPK- -DEVHVSDEFSENRSSVSKASISPSNV 


yzo 


Db 


O -7 D 


llllll II MM III III III II Ihlhl h II = Ml 

il ivO Lj X JxiM Jx x L»\jrlU O Xj x ^ X £jXjx Il]_JXJOXJivL>J X yriVv HiILlvX Jf O LJUx lD JVtNJ LjOjTj. 1 O IVV J_iJ_iJ_ilr x LJ V 


y 33 


Ov 
vy 


927 


QAT .RPOTPMOQ TURCl^T .TT<TPA RP^k'T .PCinTP^PrfP QT QA\7T CAPT Cl^TC'\A7PlT T VTaTDFIT V 
O^xjCj ir ^ X Hii\lvjiO X V IvOivOXj 1 J^JZi^J^rviVLJir OL/ 1 Xj rviliiJJXOxjOjHL V XjOjtIiIixjo iv 1 O V V xJxjxj I WrvUx IV 

III 1 h llll 1 1 1 1 1 1 1 1 II i ■ 1 1 1 Ih IIIIIIIIIIIIIIIIMI 


yob 


Db 


956 


SALATQAEI ES I VKPKVLVKEAEKKLPSDTEKEDRS PSAI FSAELSKTS WDLLYWRDI K 


1015 


Qy 


987 


KTGWFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVIQAIQKSDEGHPFRA 

IIIMIIIIMIIIIIIIIIIIIIIMIIIIIIIIIIIIMIIIIIIMIIIIIIIIIII 

KTGWFGASLFLLLSLTVFSIVSWAYIALALLSVTISFRIYKGVIQAIQKSDEGHPFRA 


1046 


Db 


1016 


1075 


Qy 


1047 


YLESEVAI SEELVQKYSNSALGHVNSTI KELRRLFLVDDLVDSLKFAVLMWVFTYVGALF 


1106 


Db 


1076 


MUM IIIIIIMIII MM; 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 

YLESEVAI SEELVQKYSNSALGHVNCTI KELRRLFLVDDLVDSLKFAVLMWVFTYVGALF 


1135 



Qy 1107 NGLTLLI LALI SLFS I PVI YERHQVQ I DHYLGLANKSVKDAMAKI QAKI PGLKRKAD 1163 

I I I M I I II I I I i h I ! I M! I I il I I I I I h I I Mill I llll I I I h 

Db 1136 NGLTLLI LALI SLFS VPVI YERHQAQ I DHYLGLANKNVKDAMAKI QAKI PGLKRKAE 1192 



RESULT 7 




ABP68600 




ID 


ABP68600 standard; Protein; 1192 AA. 


XX 






AC 


ABP68600; 




XX 






DT 


14-JAN-2003 


(first entry) 


XX 






DE 


Human pancreatic cancer expressed protein SEQ ID NO 71. 


XX 






KW 


Human; pancreas; cancer; gene therapy; vaccine; immunost imulant ; 


KW 


cytostatic; 


tumour . 


XX 






OS 


Homo sapiens. 


XX 






PN 


WO200260317- 


-A2. 


XX 






PD 


08-AUG-2002 




XX 






PF 


30-JAN-2002; 2002WO-US02781 . 


XX 






PR 


30-JAN-2001 


• 2001US-265305P. 


PR 


31-JAN-2001 


* 2001US-265682P. 


PR 


09-FEB-2001 


• 2001US-267568P. 


PR 


21-MAR-2001 


• 2001US-278651P. 


PR 


28-APR-2001 


■ 2001US-287112P. 


PR 


16-MAY-2001 


■ 2001US-291631P. 


PR 


12-JUL-2001 


■ 2001US-305484P. 


PR 


20-AUG-2001 


• 2001US-313999P. 


PR 


27-NOV-2001 


• 2001US-333626P. 


XX 






PA 


(CORI-) CORIXA CORP. 


XX 






PI 


Benson DR, 


Kalos MD, Lodes MJ, Persing DH, Hepler WT, Jiang Y; 


XX 






DR 


WPI; 2002-627435/67. 


DR 


N-PSDB; ABV94680. 


XX 






PT 


New isolated polynucleotide and pancreatic tumor polypeptides, useful 


PT 


for diagnosing, preventing and/or treating cancer, particularly 


PT 


pancreatic cancer 


XX 






PS 


Claim 2; SEQ ID NO 71; 300pp + Sequence Listing; English. 


XX 






CC 


The invention relates to an isolated polynucleotide (I) comprising: (a) 


CC 


any of a group of over 4000 nucleotide sequences (ABV94 628-ABV9914 5) ; 


CC 


(b) complements of (a) ; (c) sequences consisting of at least 20 


CC 


contiguous residues of (a) ; (d) sequences that hybridize to (a) , under 


CC 


moderately stringent conditions; (e) sequences having at least 75% or 90% 


CC 


identity to 


(a) ; or (f ) degenerate variants of (a) . Polypeptides 


CC 


(ABP68596-ABP68637) encoded by (I) and oligonucleotide can be used to 



CC detect cancer in a patient and compositions comprising polypeptides, 

CC polynucleotides, antibodies, fusion proteins, T cell populations and 

CC antigen presenting cells expressing the polypeptide are useful in 

CC treating pancreatic cancer and stimulating an immune response. The 

CC polynucleotides can be used as probes or primers for nucleic acid 

CC hybridisation, in the design and preparation of ribozyme molecules for 

CC inhibiting expression of the tumour polypeptides and proteins in the 

CC tumour cells, in vaccines and for gene therapy. 

CC Note: The sequence data for this patent did not form part of the printed 

CC specification, but was obtained in electronic format directly from WIPO 

CC at ftp.wipo. int/pub/published_pct_sequences . 
XX 

SQ Sequence 1192 AA; 

Query Match 75.3%; Score 4403.5; DB 23; Length 1192; 

Best Local Similarity 75.9%; Pred. No. 2.5e-221; 

Matches 909; Conservative 104; Mismatches 145; Indels 39; Gaps 20; 
Qy 1 MEDIDQSSLVSSSTDSPPRPPPAFKYQFVTEPEDEEDEEEEEDEEEDDEDLEELEVLERK 60 

llhlll Mill MINI MINIM llllll I II I h I h : I II 1 1 1 1 1 II I II 

Db 1 MEDLDQSPLVSSS-DSPPRPQPAFKYQFVREPEDEE-EEEEEEEEDEDEDLEELEVLERK 58 

Qy 61 PAAGLSAAAVP- -PAAAAPLLDFSSDSVPPAPRGPLPAAPPAAPERQPSWERSPAA P 115 

llllllll II III Mhll : I llllllllllllll lllllllh II : I 

Db 59 PAAGLSAAPVPTAPAAGAPLMDFGNDFVPPAPRGPLPAAPPVAPERQPSWDPSPVSSTVP 118 

Qy 116 APSLPPAAAVLPSKLPEDDEPPARPPPPPPAGASPLAE PAAPPSTPAAPKR 166 

III MM IIIIIIIIIMIIIIIMII 1 1 II MilMMIMM 

Db 119 APSPLSAAAVSPSKLPEDDEPPARPPPPPPASVSPQAEPVWTPPAPAPAAPPSTPAAPKR 178 

Qy 167 RG-SGSVDETLFALPAASEPVIPSSAEKIMDLMEQPGNTVSSGQEDFPSVLLETAASLPS 225 

II MiMMIII III MM III MlllhhIMIIIIIIIMIIIMI 

Db 179 RGSSGSVDETLFALPAASEPVI RSSAEN-MDLKEQPGNTI SAGQEDFPSVLLETAASLPS 237 

Qy 226 LSPLSTVSFKEHGYLGNLSAVSSSEGTIEETLNEASKELPERATNPFVNRDLAEFSELEY 285 

Mill IN MIMI I : I I I : : I -llllh hi ::|ll lllllll 

Db 238 LSPLSAASFKEHEYLGNLSTVLPTEGTLQENVSEASKEVSEKAKTLLIDRDLTEFSELEY 297 

Qy 286 SEMGSSFKGSPKGESAILVENTKEEVIVRSKDKED-LVCSAALHSPQESPVG KED 339 

Db 2 98 SEMGSSFSVSPKAESAVI VANPREEI IVKNKDEEEKLVSNNILHNQQELPTALTKLVKED 357 

Qy 340 RWSPEKTMDIFNEMQMSWAPVREEYADFKPFEQAWEVKDTYEGSRDVLAA RANV 3 95 

Mi ll I III —I Ihll I ll lll llh Mllh I I hIM = I - 

Db 358 EWSSEKAKDSFNEKRVAVEAPMREEYADFKPFERVWEVKDSKEDS-DMLAAGGKIESNL 416 

Qy 3 96 ESKVDRKCLEDSLEQKSLGKDSEGRNEDASFPSTPEPVKDSSRAYITCASFT-SATESTT 454 

Mllhll Mill : MM hi lllllll Ml I llllll I Mill 

Db 417 ESKVDKKCFADSLEQTNHEKDSESSNDDTSFPSTPEGIKDRSGAYITCAPFNPAATESIA 476 

Qy 455 ANTFPLLEDHTSENKTDEKKI EERKAQI ITEK-TSPKTSNPFLVAVQDSEADYVTTDTLS 513 

I MM I IIIIIIIIIIMIMIIIMM II MINIMI MM llllll h 

Db 477 TNIFPLLGDPTSENKTDEKKIEEKKAQIVTEKNTSTKTSNPFLVAAQDSETDYVTTDNLT 536 

Qy 514 KVTEAAVSNMPEGLTPDLVQEACESELNEATGTKIAYETKVDLVQTSEAIQESLYPTAQL 573 

INI IMIIIIIIIIIIIIIIIIIIII IMIIIHIM MM I MINN III 

Db 537 KVTEEWANMPEGLTPDLVQEACESELNEVTGTKIAYETKMDLVQTSEVMQESLYPAAQL 596 



Qy 574 CPSFEEAEATPSPVLPDI VMEAPLNSLLPSAGASWQPSVSPLEAPPPVSYDSIKLEPEN 633 

MMIMMMMMMIMIIMM MMIIIMIII Mill MMIM II 1 1 

Db 597 CPSFEESEATPSPVLPDIVMEAPLNSAVPSAGASVIQPSSSPLEA-SSVNYESIKHEPEN 655 

Qy 634 PPPYEEAMNVALKAL-GTKEGI KEPESFNAAVQETEAPYI SIACDLI KETKLSTEPSPDF 692 

IIIIIMhhM ■ I II llllh llhlllllllllllllllllllll hill 

Db 656 PPPYEEAMSVSLKKVSGI KEE I KEPENI NAALQETEAP YI S I ACDLI KETKLSAEPAPDF 715 

Qy 693 SNYSEIAKFEKSVPEHAELVEDSSPESEPVDLFSDDS I PEVPQTQEEAVMLMKESLTEVS 752 

MMMM M M h I II I h II I II I II 1 1 h 1 1 hi MhMIIII I 

Db , 716 SDYSEMAKVEQPVPDHSELVEDSSPDSEPVDLFSDDS I PDVPQKQDETVMLVKESLTETS 775 

Qy 753 -ETVAQHK-EERLSASPQELGKPYLESFQPNLHSTKDA-ASNDIPTLTKKEKISLQMEEF 809 

I- ^ I t I I I I lllllllh H MM = = = Ihlllll IMM 

Db 776 FESMI EYENKEKLSALPPEGGKPYLESFKLSLDNTKDTLLPDEVSTLSKKEKI PLQMEEL 835 

Qy 810 NTAI YSNDDLLSSKEDKI KESETFSDSSP I E I I DEFPTFVSAKDDS - PKLAKEYTDLEVS 868 

HHIIIII III : I ^ I ^ I I I I : I I I I I I 1 I I i I I :hl I MMIIIIlli 
Db 836 STAVYSNDDLFISKEAQIRETETFSDSSPIEI IDEFPTLISSKTDSFSKLAREYTDLEVS 895 

Qy 869 DKSEIANIQSGADSLPCLELPCDLSFKNIYPK- -DEVHVSDEFSENRSSVSKASISPSNV 926 

llllll II I I I I III III III II ^ Ihll l I II =1 ! 

Db 8 96 HKSEIANAPDGAGSLPCTELPHDLSLKNIQPKVEEKISFSDDFSKNGSATSKVLLLPPDV 955 

Qy 927 SALEPQTEMGSIVKSKSLTKEAEKKLPSDTEKEDRSLSAVLSAELSKTSWDLLYWRDIK 986 

Ml I h MM I I I I 1 1 : I t I I I ! I I : I lh I I i I i I I I I I I t ! I I I I I I 

Db 956 S ALATQAE I ES I VKP KVLVKEAE KKL PS DTEKEDRS P SA I FSAELS KTS WDLL YWRD I K 1015 

Qy 987 KTGWFGASLFLLLSLTVFS I VS VTAYI ALALLS VTI SFRI YKGVI QAI QKSDEGHPFRA 1046 

I h I M 1 1 1 M 1 1 1 1 II I MMI I II I II M 1 1 1 1 1 1 1 1 IMM I 1 1 M II 

Db 1016 KTGWFGASLFLLLSLTVFS I VS VTAYI ALALLS VTI SFRI YKGVI QAI QKSDEGHPFRA 1075 

Qy 1047 YLESEVAI SEELVQKYSNSALGHVNSTI KELRRLFLVDDLVDSLKFAVLMWVFTYVGALF 1106 

I M I 1 1 1 1 1 1 1 1 1 1 1 II I! M I M 1 1 1 1 1 1 1 1 1 1 1 M h 1 1 1 1 1 1 1 1 II 

Db 1076 YLESEVAI SEELVQKYSNSALGHVNCTI KELRRLFLVDDLVDSLKFAVLMWVFTYVGALF 1135 

Qy 1107 NGLTLLI LALI SLFS I PVI YERHQVQI DHYLGLANKSVKDAMAKI QAKI PGLKRKAD 1163 

IIMIIIIIIIMIMMMIIII MMMMMMMMMMMIMMIMM 

Db 1136 NGLTLL I LAL I SLFS VP VI YERHQAQI DHYLGLANKNVKDAMAKI QAKI PGLKRKAE 1192 



RESULT 8 
AAY56967 

ID AAY56967 standard; Protein; 1192 AA. 
XX 

AC AAY56967; 
XX 

DT 25-APR-2000 (first entry) 
XX 

DE Human MAGI polypeptide. 
XX 

KW MAGI protein; neuroendocrine- specific protein; neuropathy; human; 

KW spinal injury; neuronal degeneration; neuromuscular disorder; cancer; 

KW psychiatric disorder; developmental disorder; inflammatory disorder; 

KW stroke; cytostatic; cerebroprotective ; neuroprotective. 

XX 



OS Homo sapiens. 
XX 

PN WO200005364-A1 . 
XX 

PD 03-FEB-2000. 
XX 

PF 21-JUL-1999; 99WO-GB02360 . 
XX 

PR 22-JUL-1998; 98GB- 0016024 . 

PR 19-JUL-1999; 99GB-0016898. 
XX 

PA (SMIK ) SMITHKLINE BEECHAM PLC . 
XX 

PI Michalovich D, Prinjha RK; 
XX 

DR WPI; 2000-182693/16. 

DR N-PSDB; AA256886. 
XX 

PT Novel polypeptides related to neuroendocrine-specif ic proteins and 

PT polynucleotides useful for diagnosis of various diseases and for 

PT treatment of cancer and neurological disorders - 
XX 

PS Claim 2; Page 20-21; 35pp; English. 
XX 

CC The invention relates to human MAGI protein, which is similar to 

CC neuroendocrine-specif ic protein. The MAGI protein can be expressed by 

CC standard recombinant methodology. The MAGI polypeptides, polynucleotides 

CC and antibodies are useful for treating diseases, including neuropathies, 

CC spinal injury, neuronal degeneration, neuromuscular disorders, 

CC psychiatric disorders and developmental disorders, cancer, stroke and 

CC inflammatory disorders. The polynucleoitde is also useful for chromosome 

CC localization and for tissue expression studies. The present sequence 

CC represents the human MAGI protein. 

XX 

SQ Sequence 1192 AA; 

Query Match 75.2%; Score 4398.5; DB 21; Length 1192; 
Best Local Similarity 75.9%; Pred. No. 4.5e-221; 

Matches 908; Conservative 104; Mismatches 146; Indels 39; Gaps 20; 

Qy 1 MEDIDQSSLVSSSTDSPPRPPPAFKYQFVTEPEDEEDEEEEEDEEEDDEDLEELEVLERK 60 

llhlll Mill llllll IMIIMI MIMI MMhlMMIMMMMIM 

Db 1 MEDLDQSPLVSSS-DSPPRPQPAFKYQFVREPEDEE-EEEEEEEEDEDEDLEELEVLERK 58 

Qy 61 PAAGLSAAAVP- -PAAAAPLLDFSSDSVPPAPRGPLPAAPPAAPERQPSWERSPAA P 115 

IMIIMI I! Ill MM : I II II I M M II I II IMIIIIh II = I 

Db 59 PAAGLSAAPVPTAPAAGAPLMDFGNDFVPPAPRGPLPAAPPVAPERQPSWDPSPVSSTVP 118 

Qy 116 APSLPPAAAVLPSKLPEDDEPPARPPPPPPAGASPLAE PAAPPSTPAAPKR 166 

III MM I M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ! II II III: 

Db 119 APSPLSAAAVSPSKLPEDDEPPARPPPPPPASVSPQAEPVWTPPAPAPAAPPSTPAAPKR 178 

Qy 167 RG-SGSVDETLFALPAASEPVI PSSAEKIMDLMEQPGNTVSSGQEDFPS VLLETAASLPS 225 

II 1 1 1 1 1 1 1 M I II ; 1 1 1 II MM III MMIhhMMMIMMIMMM 

Db 17 9 RGSSGSVDETLFALPAASEPVIRSSAEN-MDLKEQPGNTISAGQEDFPS VLLETAASLPS 23 7 

Qy 226 LSPLSTVSFKEHGYLGNLSAVSSSEGTIEETLNEASKELPERATNPFVNRDLAEFSELEY 285 



-Ill MINI 

I DRDLTEFS ELEY 



Db 


238 


LSPLSAASFKEHEYLGNLSTVLPTEGTLQENVSEASKEVSEKAKTLLIDRDLTEFSELEY 


297 


Qy 


286 


SEMGSSFKGS PKGESAI LVENTKEEVI VRSKDKED - LVCSAALHSPQESPVG KED 

1 1 1 1 1 1 1 III 1 1 1 • • 1 1 • 1 1 • 1 1 • • 1 1 • 1 - II II- II 1 III 


339 


Db 


298 


1 M 1 1 1 1 III 1 1 1 • ■ 1 1 ■ 1 1 • 1 1 • • 1 1 • 1 • II • II- II I Ml 

SEMGSSFSVSPKAESAVIVANPREE I IVKNKDEEEKLVSNNILHNQQELPTALTKLVKED 


357 


Qy 


340 


RWS P E KTMD I FN EMQM S WA PVR E E YAD F KP F EQAW E VKDT Y EG S RD VLAA RANV 

III II 1 III - ■ - 1 II - 1 1 1 1 I 1 1 1 1 1 1 - II 1! 1 ■ 1 1 1 ■ 1 II • 1 - 


395 


Db 


358 


Ml II 1 III • - * 1 1 I - 1 1 1 1 1 1 1 1 1 1 1 - Mill- 1 1 1*111 • 1 * 
E WSS EKAKDSFNEKRVAVEAPMREE YADFKPFERVWEVKDS KEDS - DMLAAGGKI ESNL 


416 


Qy 


396 


ESKVDRKCLEDSLEQKSLGKDSEGRNEDASFPSTPEPVKDSSRAYITCASFT-SATESTT 

IMIIMI Mill : MM hi llllll! Ml MM 1 Mill 


454 


Db 


417 


ESKVDKKCFADSLEQTNHEKDSESSNDDTSFPSTPEGIKDRPGAYITCAPFNPAATESIA 


476 


Qy 


455 


ANTFPLLEDHTSENKTDEKKI EERKAQI ITEK-TSPKTSNPFLVAVQDSEADYVTTDTLS 


513 


Db 


477 


1 MM 1 1 1 1 ! M It 1 ! 1 1 1 M 1 1 1 M 1 1 II MM MM MM llllll M 

TNIFPLLGDPTSENKTDEKKIEEKKAQIVTEKNTSTKTSNPFLVAAQDSETDYVTTDNLT 


536 


Qy 


514 


KVTEAAVSNMPEGLTPDLVQEACESELNEATGTKIAYETKVDLVQTSEAIQESLYPTAQL 573 
1 II 1 1 . i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 11 1 1 1 1 1 1 1 1 1 II . 1 i 1 1 1 1 1 .llllll III 


Db 


537 


MM M M M M M II M 1 M II II M M M 1 M M 1 M M 1 M 1 M 1 M M Ml 
KVTEEWANMPEGLTPDLVQEACESELNEVTGTKIAYETKMDLVQTSEVMQESLYPAAQL 


596 


Qy 


574 


CPSFEEAEATPSPVLPDIVMEAPLNSLLPSAGASWQPSVSPLEAPPPVSYDSIKLEPEN 


633 


Db 


597 


IIIIIIMIIIIIIIMMIIMIII MIIIIIIMI! Mill M MM 

CPSFEESEATPSPVLPDIVMEAPLNSAVPSAGASVIQPSSSPLEA-SSVNYESIKHEPEN 


655 


Qy 


634 


PPPYEEAMNVALKAL-GTKEGIKEPESFNAAVQETEAPYISIACDLIKETKLSTEPSPDF 
1 1 1 1 1 1 M - 1 • 1 1 - 1 II 1 1 1 1 1 • 1 1 1 M 1 1 M 1 1 1 II M 1 II 1 1 II 1 1 1 M II 1 


692 


Db 


656 


1 M 1 1 1 1 1 • 1 • 1 1 • 1 II 1 1 1 1 1 • 1 1 1 • 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 : 1 1 1 
PPPYEEAMSVSLKKVSGIKEEIKEPENINAALQETEAPYISIACDLIKETKLSAEPAPDF 


715 


Qy 


693 


SNYSEIAKFEKSVPEHAELVEDSSPESEPVDLFSDDSIPEVPQTQEEAVMLMKESLTEVS 


752 


Db 


716 


HIM h MIMI MIMMMMM 1 1 IIIIIIMI hi llhllllll 1 

SDYSEMAKVEQPVPDHSELVEDSSPDSEPVDLFSDDSIPDVPQKQDETVMLVKESLTETS 


775 


Qy 


753 


- ETVAQHK- EERLSAS PQELGKPYLESFQPNLHSTKDA-ASNDI PTLTKKEKI SLQMEEF 


809 


Db 


776 


M ■■■■ MMII 1 1 III hh :| Mil ::: IMIIMI MM 

FESMI EYENKEKLSALPPEGGKPYLESFKLSLDNTKDTLLPDEVSTLSKKEKI PLQMEEL 


835 


Qy 


810 


NTAI YSNDDLLSSKEDKIKESETFSDSSPIEI IDEFPTFVSAKDDS-PKLAKEYTDLEVS 

1 i 1 1 1 1 1 1 Ml MMMIIIIIIIIIMIIIII MM II IMIMIMM 

STAVYSNDDLFISKEAQIRETETFSDSSPIEI IDEFPTLISSKTDSFSKLAREYTDLEVS 


868 


Db 


836 


895 


Qy 


869 


DKSEIANIQSGADSLPCLELPCDLSFKNI YPK- -DEVHVSDEFSENRSSVSKASISPSNV 


926 


Db 


896 


llllll II Ml III III III II IIMIM Mil M M 

HKSEIANAPDGAGSLPCTELPHDLSLKNIQPKVEEKISFSDDFSKNGSATSKVLLLPPDV 


955 


Qy 


927 


SALEPQTEMGS I VKSKSLTKEAEKKLPSDTEKEDRSLSAVLSAELSKTSWDLLYWRDI K 


986 


Db 


956 


Ml 1 M MM 1 1 MIMMIMMIMM IM MIMMMIMMIMM 

SALATQAE I ES I VKPKVLVKEAEKKLPSDTEKEDRS PSA I FSAELS KTS WDLLYWRD I K 


1015 


Qy 


987 


KTGWFGASLFLLLSLTVFS I VS WAY I ALALLSVTI SFRI YKGVI QAI QKSDEGHPFRA 


1046 


Db 


1016 


IMIMMMMMMMMMMMMMMMMMIMMMIMMMMIIMM 

KTGWFGASLFLLLSLTVFS I VSVTAYI ALALLSVTI SFRI YKGVI QAI QKSDEGHPFRA 


1075 


Qy 


1047 


YLESEVAI SEELVQKYSNSALGHVNSTI KELRRLFLVDDLVDSLKFAVLMWVFTYVGALF 


1106 



IMIMIMM i I: I I I I I I MIM MM 



Db 1076 YLESEVAI SEELVQKYSNSALGHVNCTI KELRRLFLVDDLVDSLKFAVLMWVFTYVGALF 1135 



Qy 1107 NGLTLLILALISLFSI PVI YERHQVQIDHYLGLANKSVKDAMAKIQAKI PGLKRKAD 1163 

1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 = 1 1 1 1 ! 1 1 1 IMIIIIIMhIIIIIIMIIIIIIIIIIh 

Db 1136 NGLTLL I LALI SLFS VP VI YERHQAQ I DHYLGLANKNVKDAMAKI QAKI PGLKRKAE 1192 



RESULT 9 
AAB82349 

ID AAB82349 standard; Protein; 1192 AA. 
XX 

AC AAB82349; 
XX 

DT 23-JUL-2001 {first entry) 
XX 

DE Human NOGO-A protein. 
XX 

KW NOGO-A; human; chromosome 2p21; neuropathy; spinal injury; 

KW brain injury; stroke; neuronal degeneration; Alzheimer's disease ; 

KW Parkinson's disease; neuromuscular disorder; psychiatric disorder; 

KW developmental disorder; neuroprotective; nootropic; neuroleptic; 

KW antiparkinsonian; cerebroprotective; neuroleptic; diagnosis; 

KW therapy. 

XX 

OS Homo sapiens . 
XX 

PN WO200136631-A1. 
XX 

PD 25-MAY-2001. 
XX 

PF 14-NOV-2000; 2000WO-GB0434 5 . 
XX 

PR 15-NOV-1999; 9 9GB- 002 6 995 . 

PR 24-JAN-2000; 2000GB-0001550 . 
XX 

PA (SMIK ) SMITHKLINE BEECHAM PLC. 
XX 

PI Michalovich D, Prinjha R; 
XX 

DR WPI; 2001-343822/36. 

DR N-PSDB; AAF90324. 
XX 

PT New polypeptide designated NOGO-C is a splice variant of the human NOGO 

PT gene and may be useful in the treatment of neural disorders including 

PT Alzheimer's and Parkinson's diseases 
XX 

PS Disclosure; Page 26-27; 25pp ; English. 
XX 

CC The present sequence is that of human NOGO-A. NOGO-A is a 

CC previously known splice variant of the human NOGO gene on chromosome 

CC 2p21. The invention relates to a novel splice variant, NOGO-C (see 

CC AAB82348) . It provides NOGO-C polypeptides and polynucleotides, and 

CC methods for producing such polypeptides by recombinant techniques. 

CC Also disclosed are methods for utilising NOGO-C polypeptides and 

CC polynucleotides in the treatment of diseases including neuropathies, 

CC spinal injury, brain injury, stroke, neuronal degeneration, for 

CC example Alzheimer's disease and Parkinson's disease, neuromuscular 



CC disorders, psychiatric disorders and developmental disorders. Also 

CC provided are methods for identifying agonists and agonists for 

CC use in treating conditions associated with NOGO-C imbalance, and 

CC diagnostic assays for detecting diseases associated with 

CC inappropriate NOGO-C activity or levels. 
XX 

SQ Sequence 1192 AA; 

Query Match 75.2%; Score 43 98.5; DB 22; Length 1192; 

Best Local Similarity 75.9%; Pred. No. 4.5e-221; 

Matches 908; Conservative 104; Mismatches 146; Indels 39; Gaps 20; 
Qy 1 MEDIDQSSLVSSSTDSPPRPPPAFKYQFVTEPEDEEDEEEEEDEEEDDEDLEELEVLERK 60 

INI Mill llllll MINIM I II I M 1 1 1 H ! ^ ■ , 1 1 1 1 1 M 1 1 1 

Db 1 MEDLDQSPLVSSS-DSPPRPQPAFKYQFVREPEDEE-EEEEEEEEDEDEDLEELEVLERK 58 

Qy 61 PAAGLSAAAVP- -PAAAAPLLDFSSDSVPPAPRGPLPAAPPAAPERQPSWERSPAA P 115 

I t I I I I I I M Ml Mhll = I MMIMMIMM llllllll: II : I 

Db 5 9 PAAGLSAAPVPTAPAAGAPLMDFGNDFVPPAPRGPLPAAPPVAPERQPSWDPSPVSSTVP 118 

Qy 116 APSLPPAAAVLPSKLPEDDEPPARPPPPPPAGASPLAE PAAPPSTPAAPKR 166 

III I I I I Ml I IMI llllllll; I II II lllllllllllll 

Db 119 APSPLSAAAVSPSKLPEDDEPPARPPPPPPASVSPQAEPVWTPPAPAPAAPPSTPAAPKR 178 

Qy 167 RG-SGSVDETLFALPAASEPVI PSSAEKIMDLMEQPGNTVSSGQEDFPSVLLETAASLPS 225 

II MMMMMMMIMM MM Ml IMIIhhMIIIIIIIIIMIIIII 

Db 179 RGSSGSVDETLFALPAASEPVIRSSAEN-MDLKEQPGNTISAGQEDFPSVLLETAASLPS 237 

Qy 226 LSPLSTVSFKEHGYLGNLSAVSSSEGTIEETLNEASKELPERATNPFVNRDLAEFSELEY 285 

Mill Mill llllll I : I I I : : I = = 11111= Ml = M II IMIMI 

Db 23 8 LSPLSAASFKEHEYLGNLSTVLPTEGTLQENVSEASKEVSEKAKTLLIDRDLTEFSELEY 2 97 

Qy 286 SEMGSSFKGSPKGESAILVENTKEEVIVRSKDKED-LVCSAALHSPQESPVG KED 339 

M M I II III 1 1 h M I MhlhMhh II : Ih II I III 

Db 298 SEMGSSFSVSPKAESAVIVANPREEI IVKNKDEEEKLVSNNILHNQQELPTALTKLVKED 357 

Qy 34 0 RWSPEKTMDIFNEMQMSWAPVREEYADFKPFEQAWEVKDTYEGSRDVLAA RANV 3 95 

III II I III -I I I : I I I I I I I I I I h llllh I I hill 
Db 358 EWSSEKAKDSFNEKRVAVEA.PMREEYADFKPFERWEVKDSKEDS-DMLAAGGKIESNL 416 

Qy 396 ESKVDRKCLEDSLEQKSLGKDSEGRNEDASFPSTPEPVKDSSRAYITCASFT-SATESTT 454 

I Mill : MM hi IMIMI Ml llllll I Mill 

Db 417 ESKVDKKCFADSLEQTNHEKDSESSNDDTSFPSTPEGIKDRPGAYITCAPFNPAATESIA 476 

Qy 455 ANTFPLLEDHTSENKTDEKKIEERKAQI ITEK-TSPKTSNPFLVAVQDSEADYVTTDTLS 513 

I MM I f 1 1 1 ! I M 1 1 1 1 1 = I i M = 1 1 1 II IMIIMM MM llllll 

Db 477 TNIFPLLGDPTSENKTDEKKIEEKKAQIVTEKNTSTKTSNPFLVAAQDSETDYVTTDNLT 536 

Qy 514 KVTEAAVSNMPEGLTPDLVQEACESELNEATGTKIAYETKVDLVQTSEAIQESLYPTAQL 573 

MM MMIMMMIMMMIMM I M 1 1 1 1 1 1 1 = 1 1 i 1 1 1 1 MIMM Ml 

Db 53 7 KVTEEWANMPEGLTPDLVQEACESELNEVTGTKIAYETKMDLVQTSEVMQESLYPAAQL 596 

Qy ' 574 CPSFEEAEATPSPVLPDIVMEAPLNSLLPSAGASWQPSVSPLEAPPPVSYDSIKLEPEN 633 

I II 1 1 M 1 1 II II 1 1 II I II 1 1 M II MIIMIhlll Mill M M II I MM 

Db 597 CPSFEESEATPSPVLPDIVMEAPLNSAVPSAGASVIQPSSSPLEA-SSVNYESIKHEPEN 655 



Qy 



634 PPPYEEAMNVALKAL-GTKEGI KEPESFNAAVQETEAPYI S IACDLI KETKLSTEPS PDF 692 



Illllllhhll : I II II! MM' MINIMI! MINI MM 

Db 656 PPPYEEAMSVSLKKVSGIKEEIKEPENINAALQETEAPYISIACDLIKETKLSAEPAPDF 715 

Qy 693 SNYSEIAKFEKSVPEHAELVEDSSPESEPVDLFSDDSI PEVPQTQEEAVMLMKESLTEVS 752 

I h I I ' : I I I 1,1 hi llhllllll I 

Db 716 SDYSEKAKVEQPVPDHSELVEDSSPDSEPVDLFSDDSIPDVPQKQDETVMLVKESLTETS 775 

Qy 753 - ETVAQHK- EERLSAS PQELGKP YLES FQPNLHSTKDA - ASND I PTLTKKEKI SLQMEEF 8 09 

M- ::: - MM | II MMM : | MM ::: MUM Mill 

Db 776 FESMI EYENKEKLSALPPEGGKPYLESFKLSLDNTKDTLLPDEVSTLSKKEKI PLQMEEL 835 

Qy 810 NTAI YSNDDLLSSKEDKI KESETFSDSSPI EI I DEFPTFVSAKDDS - PKLAKEYTDLEVS 8 68 

M I • M I I II III U ^ i - I I I I I I I I I ! I i I I I MM M 111 = 11111111 
Db 836 STAVYSNDDLFI SKEAQIRETETFSDSSPI EI I DEFPTLI SSKTDSFSKLAREYTDLEVS 895 

Qy 869 DKSEIANIQSGADSLPCLELPCDLSFKNI YPK- -DEVHVSDEFSENRSSVSKASISPSNV 926 

I II MM III III III || ::: Ihlhl MM : I M 

Db 8 96 HKSEIANAPDGAGSLPCTELPHDLSLKNIQPKVEEKISFSDDFSKNGSATSKVLLLPPDV 955 

Qy 927 SALEPQTEMGSI VKSKSLTKEAEKKLPSDTEKEDRSLSAVLSAELSKTSVVDLLYWRDIK 986 

Ml I h I II I I I 1 1 1 1 1 1 1 1 i 1 1 1 [ i 1 1 1 Ih IIIIIIMIIIMIIIIII 

Db 956 SALATQAEIESI VKPKVLVKEAEKKLPSDTEKEDRSPSAIFSAELSKTSVVDLLYWRDIK 1015 

Qy 987 KTGWFGASLFLLLSLTVFS I VS VTAYI ALALLSVTI SFRI YKGVI QAI QKSDEGHPFRA 1046 

MIIIMM MIMIIIIIII MMIMMI IMMMIM IMMMIMI 

Db 1016 KTGWFGASLFLLLSLTVFS I VSVTAYI ALALLSVTI SFRI YKGVI QAI QKSDEGHPFRA 1075 

Qy 104 7 YLES EVA I SEELVQKYSNSALGHVNSTI KELRRLFLVDDLVDSLKFAVLMWVFTYVGALF 1106 

IMMMMMIIMMIMIMM 1 1 1 M I MM 1 1 1 1 M 1 1 M II 1 1 1 M 

Db 1076 YLESEVAI SEELVQKYSNSALGHVNCTI KELRRLFLVDDLVDSLKFAVLMWVFTYVGALF 1135 

Qy 1107 NGLTLLILALISLFSI PVIYERHQVQIDHYLGLANKSVKDAMAKIQAKI PGLKRKAD 1163 

MIIMI IMIMIMM 1 1 1 1 1 1 1 M M 1 1 1 1 1 1 I M M I M M 

Db 113 6 NGLTLLI LALI SLFSVPVI YERHQAQI DHYLGLANKNVKDAMAKI QAKI PGLKRKAE 1192 



RESULT 10 
ABB81078 

ID ABB81078 standard; Protein; 1192 AA . 
XX 

AC ABB81078; 
XX 

DT 05-NOV-2002 (first entry) 
XX 

DE Human neurotransmitter receptor protein Nogo-A. 
XX 

KW Nerve regeneration; neuroprotection; neuronal degeneration; CNS; PNS; 

KW central nervous system; peripheral nervous system; tranquillizer; Nogo; 

KW vulnerary; cerebroprotective; anti-tumour; antidiabetic; anticonvulsant ; 

KW nootropic; antiparkinsonian; ophthalmological ; analgesic; hepatotropic ; 

KW osteopathic; vagotropic; nephrotropic ; cytostatic; antigen; gene therapy ; 

KW neurotransmitter receptor; human; receptor. 
XX 

OS Homo sapiens. 
XX 

PN US2002072493-A1 . 
XX 



PD 13-JUN-2002. 
XX 

PF 28-JUN-2001; 2001US-0893348 . 
XX 

PR 19-MAY-1998; 98 IL-0124500 . 

PR 21-JUL-1998; 98WO-US14715 . 

PR 22-DEC-1998; 98US-0218277 . 

PR 19-MAY-1999; 99US-0314 161 . 
XX 

PA ( YEDA ) YEDA RES & DEV CO LTD. 
XX 

PI Eisenbach-Schwartz M, Hauben E, Cohen IR, Beserman P, Mosonego A; 

PI Moalem G; 

XX 

DR WPI; 2002-607255/65. 

DR N-PSDB; ABN86601. 
XX 

PT Promoting nerve regeneration and preventing neuronal degeneration in 

PT the central /peripheral nervous system from injury/disease, comprises 

PT administering nervous system-specific activated T cells/antigen, or 

PT analogs /peptides 
XX 

PS Examples; Page 53-56; 93pp; English. 
XX 

CC The invention relates to promoting nerve regeneration or conferring 

CC neuroprotection and preventing or inhibiting neuronal degeneration in the 

CC central /peripheral nervous system (NS) . The method involves administering 

CC NS-specific activated T cells, NS-specific antigen, its analogue or its 

CC peptide, a nucleotide sequence the NS-specific antigen or its analogue or 

CC combinations. The method is useful for promoting nerve regeneration and 

CC preventing neuronal degeneration in central /peripheral nervous system 

CC from injury/disease, where the injury is spinal cord injury, blunt 

CC trauma, penetrating trauma, hemorrhagic stroke, ischaemic stroke or 

CC damages caused by surgery such as tumour excision. The disease is not an 

CC autoimmune disease or neoplasm. The disease results in a degenerative 

CC process occurring in either gray or white matter or both. The disease 

CC is diabetic neuropathy, senile dementia, Alzheimer's disease, Parkinson's 

CC disease, facial nerve (Bell's) palsy, glaucoma, Huntington's chorea, 

CC amyotrophic lateral sclerosis, non-arteritic optic neuropathy, and 

CC vitamin deficiency, intervertebral disc herniation, prion diseases such 

CC as Creutzf el dt- Jakob disease, carpal tunnel syndrome, peripheral 

CC neuropathies associated with various diseases, including but not limited 

CC to uremia, porphyria, hypoglycemia, Sjorgren Larsson syndrome, acute 

CC sensory neuropathy, chronic ataxic neuropathy, biliary cirrhosis, primary 

CC amyloidosis, obstructive lung diseases, acromegaly, malabsorption 

CC syndromes, polycythemia vera, immunoglobulin (Ig)A- and IgG gamma - 

CC pathies, complications of various drugs (e.g., metronidazole) and toxins 

CC (e.g., alcohol or organophosphates) , Charcot -Marie-Tooth disease, ataxia 

CC telangectasia, Friedreich's ataxia, amyloid polyneuropathies, 

CC adrenomyeloneuropathy, Giant axonal neuropathy, Ref sum's disease, Fabry's 

CC disease, or 1 ipoproteinemia . The present sequence represents the human 

CC neurotransmitter receptor protein Nogo-A, an example of NS-specific 

CC antigen. 

XX 

SQ Sequence 1192 AA; 



Query Match 



75.2%; Score 4398.5; DB 23; Length 1192; 



Best Local Similarity 75.9%; Pred. No. 4.5e-221; 

Matches 908; Conservative 104; Mismatches 146; Indels 39; Gaps 20; 
Qy 1 MEDIDQSSLVSSSTDSPPRPPPAFKYQFVTEPEDEEDEEEEEDEEEDDEDLEELEVLERK 6 0 

llhlll Mill MINI 1 1 1 1 1 ! 1 1 MM M MM I MM , M 1 1 1 1 1 M 

Db 1 MEDLDQSPLVSSS-DSPPRPQPAFKYQFVREPEDEE-EEEEEEEEDEDEDLEELEVLERK 58 
Qy 61 PAAGLSAAAVP- -PAAAAPLLDFSSDSVPPAPRGPLPAAPPAAPERQPSWERSPAA P 115 

IMIIIII II III MM M IIIIIMIMIIII MINIM: II : I 

Db 59 PAAGLSAAPVPTAPAAGAPLMDFGNDFVPPAPRGPLPAAPPVAPERQPSWDPSPVSSTVP 118 

Qy 116 APSLPPAAAVLPSKLPEDDEPPARPPPPPPAGASPLAE PAAPPSTPAAPKR 166 

III MM MMMIMMMMIMM II II MIMIMIMM 
Db 119 APSPLSAAAVSPSKLPEDDEPPARPPPPPPASVSPQAEPVWTPPAPAPAAPPSTPAAPKR 178 

Qy 167 RG-SGSVDETLFALPAASEPVIPSSAEKIMDLMEQPGNTVSSGQEDFPSVLLETAASLPS 225 

M MIMIMMMMMIM MM III 

Db 179 RGSSGSVDETLFALPAASEPVIRSSAEN-MDLKEQPGNTISAGQEDFPSVLLETAASLPS 237 

Qy 226 LSPLSTVSFKEHGYLGNLSAVSSSEGTIEETLNEASKELPERATNPFVNRDLAEFSELEY 285 

Mill Mill MUM I : I I I : : I -Mllh hi -III MIIMI 

Db 238 LSPLSAASFKEHEYLGNLSTVLPTEGTLQENVSEASKEVSEKAKTLLIDRDLTEFSELEY 2 97 

Qy 286 SEMGSSFKGS PKGESAI LVENTKEEVI VRSKDKED- L VCSAALHS PQES PVG -KED 33 9 

MIIMI Ml 1 1 1 : : i I MIM: : MMM II : M M I Ml 

Db 298 SEMGSSFSVSPKAESAVI VANPREEI IVKNKDEEEKXiVSNNILHNQQELPTALTKLVKED 357 
Qy 340 RWSPEKTMDIFNEMQMSWAPTOEEYADFKPFEQAWEVKDTYEGSRDVLAA RANV 395 

III II I III i MMMMMMM Mllh I I hill 

Db 358 EWSSEKAKDSFNEKRVAVEAPMREEYADFKPFERWEVKDSKEDS-DMIAAGGKIESNL 416 

Qy 3 96 ESKVDRKCLEDSLEQKSLGKDSEGRNEDASFPSTPEPVKDSSRAYITCASFT-SATESTT 454 

llllhll 1 1 1 1 1 : MM hi MIIMI Ml MMM I Mill 

Db 417 ESKVDKKCFADSLEQTNHEKDSESSNDDTSFPSTPEGIKDRPGAYITCAPFNPAATESIA 476 

Qy 455 ANTFPLLEDHTSENKTDEKKI EERKAQI ITEK-TSPKTSNPFLVAVQDSEADYVTTDTLS 513 

I MM I IIIMIIIIIIMMIIhlll II IMIIIMI MM MMM h 

Db 4 77 TNI FPLLGDPTSENKTDEKKI EEKKAQI VTEKNTSTKTSNPFLVAAQDSETDYVTTDNLT 53 6 

Qy 514 KVTEAAVSNMPEGLTPDLVQEACESELNEATGTKIAYETKVDLVQTSEAIQESLYPTAQL 573 

MM MM . MMMMMMM IMIIIMIhlMMM MIIMI III 

Db 537 KVTEEWANMPEGLTPDLVQEACESELNEVTGTKIAYETKMDLVQTSEVMQESLYPAAQL 596 

Qy 574 CPSFEEAEATPSPVLPDIVMEAPLNSLLPSAGASWQPSVSPLEAPPPVSYDSIKLEPEN 633 

U M I i I Ml I I I I Ml HIM hhlll 1 1 1 1 

Db 597 CPSFEESEATPSPVLPDIVMEAPLNSAVPSAGASVIQPSSSPLEA-SSVNYESIKHEPEN 655 

Qy 634 PPPYEEAMNVALKAL-GTKEGIKEPESFNAAVQETEAPYISIACDLIKETKLSTEPSPDF 692 

MIMMhhll : I II Mllh llhlllMIIIIIIIIIIIIIIII MMM 

Db 656 PPPYEEAMSVSLKKVSGI KEEI KEPENINAALQETEAPYI SIACDLI KETKLSAEPAPDF 715 

Qy 693 SNYSE I AKFEKSVPEHAELVEDSS PESEPVDLFSDDS I PEVPQTQEEAVMLMKESLTEVS 752 

MMM h . MMMMMMMMMMMMMM hi MhMMM I 

Db 716 SDYSEMAKVEQPVPDHSELVEDSSPDSEPVDLFSDDSIPDVPQKQDETVMLVKESLTETS 775 

Qy 753 - ETVAQHK- EERLSAS PQELGKP YLESFQPNLHSTKDA - ASND I PTLTKKEKI SLQMEEF 809 

! ::: MMM I I IMMIIh M MM MMM Mill 



Db 



776 FESMI EYENKEKLSALPPEGGKPYLESFKLSLDNTKDTLLPDEVSTLSKKEKI PLQMEEL 835 



Ov 


810 


NTAI YSNDDLLSSKEDKI KESETFSDSSPI E 1 1 DE F PTF VSAKDDS - PKLAKEYTDLEVS 


868 


Db 


836 


■I I, IN 1 1 hi ! :|:| || I 

STAVY^NDDTiF T SKEAOT R ETETF^ncJS PT F T T DFPPTTiT^^KTn^FCiKT ,ap p VTDT ,RV^ 

t_J X Ji. V 1 LjlMUJJJjr X O -Tv Hi x Hi x 17 ub'iJijri J_i X J. J_/ Hi F IT lUXuDIMUDr r\ 1 iM K r. X J. LJ 1—1 Hi V i_> 


ft 

O J7 


Ov 


869 


DKSEIAMIOSGAD^LPrLELPfDIj^FKNT YPK- -DRVHVCinppQFKfRCJ^V^KA^T ^PSKTV 

i— ' x >-hJ J— i -L tiii -l ^ ij ij xj x \— J— i i— i J— 1 1 \_ J—/ xj ij i i vlv _l x i iv xv Hi v 1 1 v O J_ J ill 1 Hi i 1 ^ x\. jj v j ivtAO 1 Dr u X*J V 

MINI II MM III III III II Ihlhl h II : 1 M 


_7 £i O 


Db 


8 96 


HKSEIANAPDGAGSLPCTELPHDLST.KNTOPKVFFKT SR^nnF^KTJn^ATCJKVT ,T .T.PPDV 

i nv-iw* J— i xnivini i^vjinvj i^j xj l. \_ x j ■ j 1 1 ihjuu xji \x\j x i iv v jjijivi o I ouur i_j i\x^ ijrt. X O i\. V XJXJXJ X x Xx V 


-7 -J 


Ov 


927 


SALEPOTEMGS I VKSKSLTKEAEKKLPSDTEKEDRSLSAVT .SAET i^KT^VVnT ,T ,YWRm K 

III 1 h MM 1 1 IIIIIIIIIMIIIIM Ih MMMIMIMMIMM 

SALATOAEI ESI VKPKVLVKEAEKKT.P^nTEKEDRSP^AT F^AFT.^KT^V\7T)T .T.YWRnT V 


Z> O D 


Db 


956 


X \J X .J 


Ov 


987 


KTGWFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVIQAIQKSDEGHPFRA 


1046 


Db 


1016 


MINIMI II MINIMI IMIIIIIIIIII III MINIMI IIIIIMIIMI III 

KTG WFGAS L F LLL S LT VF S I VS VTA Y I ALALLS VT I S F R I YKGVI QAI QKSDEGHPFRA 


1075 


Qy 


1047 


YLESEVAI SEELVQKYSNSALGHVNSTI KELRRLFLVDDLVDSLKFAVLMWVFTYVGALF 


1106 


Db 


1076 


MIIMMIMNIININMIM IIIIMIIMIIIIMIIIIIIIIIMIIIIIII 

YLESEVAI SEELVQKYSNSALGHVNCTI KELRRLFLVDDLVDSLKFAVLMWVFTYVGALF 


1135 


Qy 


1107 


NGLTLLI LALI SLFS I PVI YERHQVQ I DHYLGLANKS VKDAMAKI QAKI PGLKRKAD 1163 


Db 


1136 


M 1 1 1 1 1 1 1 1 1 1 1 1 h 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 : 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 : 
NGLTLLI LALI SLFS VPVI YERHQAQ I DHYLGLANKNVKDAMAKI QAKI PGLKRKAE 1192 



RESULT 11 


ABG3 0 93 8 


ID 


ABG30938 standard; Protein; 1192 AA. 


XX 




AC 


ABG3 0 938; 


XX 




DT 


21-OCT-2002 (first entry) 


XX 




DE 


Human NogoA protein. 


XX 




KW 


Human; Nogo; BACE; acute neuronal injury; spinal injury; head injury ; 


KW 


stroke; peripheral nerve damage; neoplastic disorder; glioblastoma; 


KW 


neuroblastoma; hyperprolif erat ive disorder; dysprol iterative disorder; 


KW 


cirrhosis; psoriasis; keloid formation; fibrocystic condition; cancer; 


KW 


tissue hypertrophy; central nervous system; axon regeneration; NogoA; 


KW 


Nogo-associated disease; metastasis. 


XX 




OS 


Homo sapiens. 


XX 




PN 


WO200257483-A2 . 


XX 




PD 


25-JUL-2002 . 


XX 




PF 


18-JAN-2002; 2 002WO-GB00228 . 


XX 




PR 


18-JAN-2001; 2001GB- 00013 12 . 


XX 




PA 


(GLAX ) GLAXO GROUP LTD . 


PA 


(SMIK ) SMITHKLINE BEECHAM PLC. 



PI Blackstock WP, Hale RS, Prinjha R, Rowley A; 
XX 

DR WPI; 2002-599722/64. 

DR N-PSDB; ABK90134. 
XX 

PT Identifying modulators of Nogo or BACE activity for treating acute 

PT neuronal injuries, neoplastic or dysprolif erative disorders, comprises 

PT providing and monitoring interaction between Nogo and BACE polypeptides 

PT 

XX 

PS Disclosure; Page 59-62; 68pp; English. 
XX 

CC The present invention relates to a new method of identifying modulators 

CC of Nogo function or BACE activity. The method involves providing Nogo and 

CC BACE polypeptides capable of binding with each other, monitoring the 

CC interaction between these polypeptides, and determining if the test agent 

CC is a modulator of Nogo or BACE activity. The method is useful in treating 

CC acute neuronal injuries, such as spinal or head injury, stroke, 

CC peripheral nerve damage, and in neoplastic (e.g. glioblastomas, 

CC neuroblastomas), hyperprol if erative or dysprolif erative disorders (e.g. 

CC cirrhosis, psoriasis, keloid formation, fibrocystic conditions, tissue 

CC hypertrophy) of the central nervous system. The BACE polypeptide is 

CC useful in screening methods to identify agents that may act as modulators 

CC of BACE activity and in particular agents that may be useful in treating 

CC Nogo-associated diseases. The modulators of Nogo or BACE polypeptides, 

CC and the polynucleotide encoding the BACE polypeptide are useful in 

CC manufacturing a medicament for the treatment or prevention of disorders 

CC responsive to the modulation of Nogo activity, in alleviating the 

CC symptoms or improving the condition of a patient suffering from this 

CC disorder, in axon regeneration, or in preventing metastasis or spreading 

CC of a cancer. The polynucleotide may also be an essential component in 

CC assays, a probe, in recombinant protein synthesis, and in gene therapy 

CC techniques. The present amino acid sequence represents the human NogoA 

CC protein of the invention. 

XX 

SQ Sequence 1192 AA; 



Query Match 75.2%; Score 43 98.5; DB 23; Length 1192; 

Best Local Similarity 75.9%; Pred. No. 4.5e-221; 

Matches 908; Conservative 104; Mismatches 146; Indels 39; Gaps 20; 
Qy 1 MEDIDQSSLVSSSTDSPPRPPPAFKYQFVTEPEDEEDEEEEEDEEEDDEDLEELEVLERK 60 

MM Mill llllll MINIM MUM I II M = II = : M I II II M I II I 

Db 1 MEDLDQSPLVSSS-DSPPRPQPAFKYQFVREPEDEE-EEEEEEEEDEDEDLEELEVLERK 58 
Qy 61 PAAGLSAAAVP- -PAAAAPLLDFSSDSVPPAPRGPLPAAPPAAPERQPSWERSPAA P 115 

II II 1 1 1 1 M III MM : I It 1 1 1 1 1 1 ) M 1 1 1 MINIM: | : | 

Db 59 PAAGLSAAPVPTAPAAGAPLMDFGNDFVPPAPRGPLPAAPPVAPERQPSWDPSPVSSTVP 118 



Qy 116 APSLPPAAAVLPSKLPEDDEPPARPPPPPPAGASPLAE PAAPPSTPAAPKR 166 

Ml 1 1 1 1 1 1 1 Mi I! I !i II I II 1 1; || I NIINNINN 

Db 119 APSPLSAAAVSPSKLPEDDEPPARPPPPPPASVSPQAEPVWTPPAPAPAAPPSTPAAPKR 178 

Qy 167 RG-SGSVDETLFALPAASEPVIPSSAEKIMDLMEQPGNTVSSGQEDFPSVLLETAASLPS 225 

II NNIMNNININN INI III 1 1 1 1 1 h M 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 

Db 179 RGSSGSVDETLFALPAASEPVIRSSAEN-MDLKEQPGNTISAGQEDFPSVLLETAASLPS 237 



Ov 


226 


LSPLSTVSFKEHGYLGNLSAVSSSEGTIEETLNEASKELPERATNPFWRDIAEFSELEY 

Mill Mill llllll 1 ii II |:| ::||| iiiiiii 

LSPLSAASFKEHEYLGNLSTVLPTEGTLQENVSEASKEVSEKAKTLLIDRDLTEFSELEY 


285 


Db 


238 


297 


Ov 


286 


SEMn^FKG^PKGE^ATLVRNTKFRVT VP^KnKFD TATT^A AT.W^PnFQPVr: KRF) 


33 9 


Db 


298 


lllllll III Ml- I MMhHhh II : Ih II 1 III 

SEMGSSFSVSPKAESAVI VANPREEI I VKNKDEFFKT .V^NN T T .HNDDFT ,PTAT,TK7 ,VKED 

lj -l— 'i iVJiJ < — ' X U ViJl Jl vtt-J — I ■ — »J7^. V _L v nil £ IV J_l J 1 X X. v INXN I VI— 'JJij ijiVLI V O IV IN X XJIXXV x^/'y/ j_j XJ C X^VXJ X 1 VXJ V 1 VX_J LJ 


357 


Ov 
vy 


340 


RVVSPEKTMDIFNEMOMSVVAPVREEYADFKPFEOAWEVKDTYRrJ^RDVT.AA RANV 

X\. V V t^J X. XJ IV 1 i 1 1— ' J_ i- -L * Ul 1*^/1 1 t^J v V .ii J_ v 1VU X_J J. xi-i—/ J_ 1 v, X X. Ill yAH X-J V I VI— ^ A X X_J v_J IV J_#/ V 1 LrA XXjTaXV V 

III II 1 III : : M Ihllllllllllh Mllh 1 1 hill :h 


3 95 


Db 


358 


EWSSEKAKDSFNEKRVAVEAPMREEYADFKPFERWEVKDSKEDS-DMljAAGGKIESNL 


416 


Ov 


396 


ESKVDRKCLEDSLEQKSLGKDSEGRNEDASFPSTPEPVKDSSRAYITCASFT-SATESTT 


454 


Db 


417 


llllhll Mill : MM hi lllllll Ml MUM 1 Mill 

ESKVDKKCFADSLEOTNHEKD^E^^NDDT^FP^TPEnTKDRPnAYTTrAPFMPAATRCITA 


476 


Ov 

vy 


455 


ANTFPLLEDHTSENKTDEKKI EERKAQI I TEK- TS PKTSNPFLVAVQDSEADYVTTDTLS 

1 MM 1 II II III III MM IIMI II IIIIIIMI MM MUM h 


513 


Db 


477 


TNIFPLLGDPTSENKTDEKKIEEKKAOI VTEKNTSTKT^NPFTiVAAOn^ETDYVTTDNTiT 


JJD 


Ov 


514 


KVTEAAVSKfMPEGLTPDT • VOEAPE^ET .NEATGTKT A YETKVDT .VOT^FA T OF^T .YPTADT . 

1 \- v X 1 JrLri v l_J XM 1 1 i_ i_ *V_J J — I J. i_ v ^i_inv^U i_} X_j XJXM J ilx X \J X J. \_ _L jTa. X J_J 1 Kv I v I I v \j X t_? LjxI X ^/ X-J LD * 1 X XT X Jri\^/X_l 


R7? 


Db 


537 


MM hllM IMIIIII III IIMI 1 IIIIIIIM hllll II Mlllll III 

KVTEEWANMPEGLTPDLVQEACESELNEVTGTKIAYETKMDLVQTSEVMQESLYPAAQL 


596 


Ov 
vy 


574 


rP^FEEAEATP^PWjPnTWRAPTiM^TiT.P^AnA^WOP^V^PT.PaPPPV^Yn^TKT.PPPlvf 

\ — t OJ. oiIrli_jr\ X ttjx VUtUl V 11 Xjjtl It XJ1M O JJUrOiT-OriO V vyr O VO x Xj£Yrt.x x x V Lj lUOi. n 1 Tm L_i1>J 


D o j 


Db 


597 


1 . 1 I 1 1 1 ; 1 : 1 Mill II MM 

CPSFEESEATPSPVLPDI VMEAPLN^AVP^AGA^VTOP^^^PT.FA-^^VMYRCITKFIFPFKr 

' — i- t-* ■>- -i— ' i— it— I J— in. X 1. i — I i. v XJX i-J XVI l i4ii 1. JLJ1N wJ£*. V X UnVJ^ltJ V X l JiJur XJXjjTI. UlJ VIM X XJO X XVXXX-i C Lilly 


D -D 


Qv 


634 


PPPYEEAMNVALKAL-GTKEGIKEPESFNAAVQETEAPYISIACDLIKETKLSTEPSPDF 


692 


Db 


656 


IIMIIIhhM : 1 II Mllh MM 1 Mill II llllll IIMI Ihlll 

PPPYEEAMSVSLKKVSGIKEEIKEPEMTNAAT.OETEAPYTCIT ATnT.T KFTRT.9AFPAPDR 


71 R 


Ov 


693 


SNYSEIAKFEKSVPEHAELVEDSSPE^EPVDLF^DD^I PEVPOTOFFAVMT .MKF^T.TFVc; 

i^x>l X l-V LJ JL aI. l. i — 1 1 v J_ J — i X UTx i — J J — i w J — J XV u_J X X_i uV Xlf XT V X>XJX X XT X^l v X \^ X VV X-J X_UTx V 1 1XJ1 1 l\.LLl lJ 1 i X X_J V tZ? 


752 


Db 


716 


■ 1 h 1 1 : 1 1 1 1 1 1 1 ^ hi llhllllll 1 

^DY^FMAKVFOPVPnH^FT,VFnc;QPn ( ^FP\7nT,R c Jr)n c ^T pn\7PnT<rnnRT\7iviT .v^p^t trtq 


/ / D 


Ov 
vy 


753 


-ETVAOHK-EERIi^A^POFTtGKPYT.F^RnPMTiHCITKnA- A ^MHT PTT.TKK'RK'T ciT.nMRPR 


ft n q 

O U -7 


Db 


776 


I'-'- MMM 1 1 1 M MM 1 1 Mill 

FESMT EYENKEKTi^ATiPPFGGKPYT.F^RKT.^T.nMTK'nTT.T.PnR^^ClTT.QK'K'RK'T PT.OMRRT 

X Xji_ji ix Xj X X-ilV lVi_j X vXJi_>xt.XJ c x Lj\_rV_jr\.x X JJDDx IvxjO LjJJLM X L\±J X XJXJ x xJLj VOX JjO JTviVd IV J. r 1 llJlM "i "j J , 1 


p 7 r 
0 j j 


Ov 
vy 


810 


NTAI YSNDDLLS SKFDKT KF^FTF^n^C! PT F T T DRRPTFV^ A KDF)^ - PK7 .A VP YTHT .R\7Q 

J- ' X ^1. X X tJXM l-'X-' XJXJlJ lJ IVXjX>I\.X IVX_i J Dir JL/iJijri£il 1UDF ri r V tDJr\ IXLJUkD X r\ 1 IH r\ P. X L XJXJIli V O 


ft 6fi 

O D O 


Db 


836 


1 1 1 1 1 1 1 1 III MMMMMMMMMIMI MM II llhllllllll 

STAVYSNDDLFISKEAQIRETETFSDSSPIEI IDEFPTLISSKTDSFSKLAREYTDLEVS 


OQC 

0 y 0 


Ov 

vy 


869 


xJivo Lj x JAiM x ^00/A.J-JiDxJir ^xjCjxj it V_JJljor Ivxsi X I r IV JJd vrl V oxJxjr oCilMKoo Vbiuiib 1 o ir oiN V 


Q 0 C 


Db 


896 


MMM II MM III III III II ::: Ihlhl h II \ ■ 1 

HKSEIANAPDGAn^TiPPTFT .PHDT.^TiKMT DPTCVRFKT QRQDDRQK'KrnQATQ^rr TT PPDV 

i iiv.ij Xj x ru^rir L/uAUOXir ^ii-jXir riX^XJiZ) XjiVLM l^r IVV £L xjIV J. o " o xJl_/ IT O IVLN \JOii. J. O IV V xjxjxjx rJJV 


j D Z> 


Ov 
vy 


927 


^ATiFPOTEMG^ T VK'^K^T.TKRAFK'K'T.P^nTFPCRnR QT,c;a^7T,QART 1 QK'TQ\A/T1T ,T .YWPnT V 

iJ/lXI Xj X Vi X Xj1*1\_I i_> X v IViZJ IViD XJ X IVI-Ln.Xj iVlVXJ x OxJ X Xj rVIjlJl\.DlJijH V jUOiil-jXJi_) IV lij V V | / 1 > 1 I x ii x\.LJ X IV 

Ml 1 h MM 1 1 h 1 1 1 1 i ill Ih 1 1 M 1 1 III 

SAlxATQAEIESIVKPKVLVKEAEKKLPSDTEKEDRSPSAIFSAELSKTSVVDLLYWRDIK 


Q ft 6 


Db 


956 


1015 


Qy 


987 


KTGWFGASLFLLLSLTVFS I VS VTAYI ALxALLS VTI SFRI YKGVI QAI QKSDEGHPFRA 


1046 


Db 


1016 


IIIIIIIMIIIIIIIIIIIIIIIMIIIIIIIIIMMIIIIIIIIIIIIIIIIIIIII 

KTGWFGASLFLLLSLTVFS I VSVTAYIALALLSVTI SFRI YKGVIQAI QKSDEGHPFRA 


1075 


Qy 


1047 


YLESEVAI SEELVQKYSNSALGHVNSTI KELRRLFLVDDLVDSLKFAVLMWVFTYVGALF 


1106 



II IIIIMIM MINIMI 1 1 1 ! 1 1 ! 1 1 M I M M i 1 1 1 1 1 1 II 1 1 11 1 1 1 

Db 1076 YLESEVAI SEELVQKYSNSALGHVNCTI KELRRLFLVDDLVDSLKFAVLMWVFTYVGALF 1135 

Qy 1107 NGLTLLILALISLFSIPVI YERHQVQIDHYLGLANKSVKDAMAKIQAKI PGLKRKAD 1163 

MIIIIMIMIII|:|IIIIIII llllllilllhllllllllllillllllll: 

Db 1136 NGLTLL I IAL I SLFS VP VI YERHQAQ I DHYLG IAN KNVKDAMAKIQAKI PGLKRKAE 1192 

RESULT 12 
AAY71311 

ID AAY71311 standard; Protein; 1178 AA. 
XX 

AC AAY71311; 
XX 

DT 02-NOV-2000 (first entry) 
XX 

DE Human neurite growth inhibitor Nogo. 
XX 

KW Human; neurite growth inhibitor; Nogo; neural cell; myelin; CNS; 

KW central nervous system; neoplastic disease; antiproliferative; glioma; 

KW antisense gene therapy; neuroblastoma; menagioma; retinoblastoma; 

KW degenerative nerve disease; Alzheimer's disease; Parkinson's disease; 

KW hyperprol iterative disorder; benign dysprol iterative disorder; diagnosis; 

KW psoriasis; tissue hypertrophy; neuronal regeneration; treatment ; 

KW structural plasticity; screening. 

XX 

OS Homo sapiens . 
XX 

FH Key Location/Qualifiers 

FT Misc-dif f erence 187 

FT /label= Unknown 

FT Misc-dif ference 188 

FT /label= Unknown 

FT Misc-dif ference 189 

FT /label- Unknown 

FT Misc-dif ference 190 

FT /label= Unknown 

FT Misc-dif ference 221 

FT /label= Unknown 

FT Misc-dif ference 328 

FT /label- Unknown 

FT Misc-dif ference 477 

FT /label= Unknown 

FT Region 994. .1174 

FT /note= "Region specifically described in claim 16" 

FT Region 977 . . 1012 

FT /note= "Region specifically described in claim 16" 

FT Region 1079. . 1114 

FT /note= "Region specifically described in claim 16" 
XX 

PN WO200031235-A2. 
XX 

PD 02-JUN-2000. 
XX 

PF 05-NOV-1999; 99WO-US26160 . 
XX 

PR 06-NOV-1998; 98US- 010744 6 . 



XX 

PA (SCHW/) SCHWAB ME. 

PA (CHEN/) CHEN MS. 
XX 

PI Schwab ME, Chen MS; 
XX 

DR WPI; 2000-400052/34. 
XX 

PT Nogo proteins and nucleic acids useful for treating neoplastic 

PT disorders of the central nervous system and inducing regeneration of 

PT neurons - 

XX 

PS Claim 11; Fig 13; 122pp; English. 
XX 

CC The present sequence is a human Nogo protein which is a 

CC potent neural cell growth inhibitor and is free of all central nervous 

CC system (CNS) myelin material with which it is natively associated. The 

CC human Nogo sequence was derived by aligning human expressed sequence tags 

CC (ESTs) e.g. AA158636, AA333267, AA081783, AA167765, AA322918, AA092565, 

CC AA081525 and AA081840 with the rat Nogo sequence. 

CC Nogo proteins and fragments displaying neurite growth inhibitory 

CC activity are used in the treatment of neoplastic disease of the CNS 

CC e.g. glioma, glioblastoma, medulloblastoma , craniopharyngioma, ependyoma, 

CC pinealoma, haemangiobla stoma , acoustic neuroma, oligodendroglioma, 

CC menagioma, neuroblastoma or retinoblastoma and degenerative nerve 

CC diseases e.g. Alzheimer's and Parkinson's diseases. Therapeutics which 

CC promote Nogo activity can be used to treat or prevent hyperprol iterative 

CC or benign dysprol iterative disorders e.g. psoriasis and tissue 

CC hypertrophy. Ribozymes or antisense Nogo nucleic acids can be used to 

CC inhibit production of Nogo protein to induce regeneration of neurons or 

CC to promote structural plasticity of the CNS in disorders where neurite 

CC growth, regeneration or maintenance are deficient or desired. 

CC The animal models can be used in diagnostic and screening methods for 

CC predisposition to disorders and to screen for or test molecules which 

CC can treat or prevent disorders or diseases of the CNS. 

CC Note: SEQ ID numbers 35-42 are referred in claim 32 and SEQ ID NO: 29 

CC in disclosure of the specification. However the specification does not 

CC include sequences for these SEQ ID numbers. 

XX 

SQ Sequence 1178 AA; 



Query Match 73.1%; Score 4276.5; DB 21; Length 1178; 

Best Local Similarity 73.8%; Pred. No. le-214; 

Matches 883; Conservative 104; Mismatches 158; Indels 51; Gaps 19; 
Qy 1 MEDIDQSSLVSSSTDSPPRPPPAFKYQFVTEPEDEEDEEEEEDEEEDDEDLEELEVLERK 6 0 

MUM Mill II IN MINIM MMM II II h I h M 1 1 M I II 1 1 1 II 

Db 1 MEDLDQSPLVSSS-DSVPRPQPAFKYQFVREPEDEE-EEEEEEEEDEDEDLEELEVLERK 58 
Qy 61 PAAGLSAAAVP- -PAAAAPLLDFSSDSVPPAPRGPLPAAPPAAPERQPSWERSPAA P 115 

Ml III II Ml IMMI M MINIMUM MIMM II : I 

Db 59 PAAGLSAAPVPTAPAAGAPLMDFGNDFVPPAPRGPLPAAPPVAPERQPSWDPSPVSSTVP 118 



Qy 116 APSLPPAAAVLPSKLPEDDEPPARPPPPPPAGASPLAE PAAPPSTPAAPKR 166 

Ml MM 1 1 1 1 1 1 1 i 1 1 1 1 1 f i 1 1 1 1 1 M II Mill IMMI 

Db 119 APSPLSAAAVSPSKLPEDDEPPARPPPPPPASVSPQAEPVWTPPAPAPAAPPSTPAAPKR 178 



Qy 167 RGSGSVDETLFALPAASEPVI PSSAEKIMDLMEQPGNTVSSGQEDFPSVLLETAASLPSL 226 

Ml = Mill lllllhhlllllllllllllll Ml 

Db 179 RGSSG AWXXXXKIMDLKEQPGNTISAGQEDFPSVLLETAASXPSL 224 

Qy 227 SPLSTVSFKEHGYLGNLSAVSSSEGTIEETLNEASKELPERATNPFVNRDLAEFSELEYS 286 

MM IMM llllll I = I I I = = I ::|lllh hi = = I M 

Db 225 S PLS AAS FKEHE YLGNLST VL PTEGTLQENVS EAS KE VS E KAKTLL I DRDLTE FS ELE YS 284 

Qy 287 EMGSSFKGSPKGESAILVENTKEEVI VRSKDKED-LVCSAALHSPQESPVG KEDR 340 

llllll III Ml- I :|hlh:|hh II = II II I III 

Db 285 EMGSSFSVSPKAESAVIVANPREEI I VKNKDEEEKLVSNNILHXQQELPTALTKLVKEDE 344 

Qy 341 WSPEKTMDIFNEMQMSWAPWEEYADFKPFEQAWEVKDTYEGSRDVLAA RANVE 396 

Ml II I III -M IhllMllllllh Mllh I I hill MM 

Db 345 WSSEKAKDSFNEKRVAVEAPMREEYADFKPFERVWEVKDSKEDS-DMLAAGGKIESNLE 403 

Qy 3 97 SKVDRKCLEDSLEQKSLGKDSEGRNEDASFPSTPEPVKDSSRAYITCASFT-SATESTTA 455 

1111 = 11 Mill : MM hi lllllll Ml I llllll I Mill 

Db 404 SKVDKKCFADSLEQTNHEKDSESSNDDTSFPSTPEGIKDRSGAYITCAPFNPAATESIAT 463 

Qy 4 56 NTFPLLEDHTSENKTDEKKIEERKAQI ITEK-TSPKTSNPFLVAVQDSEADYVTTDTLSK 514 

I llllll 1 1 1 1 IMMIIIMMIMM II llllll II Ml llllll hi 

Db 464 NIFPLLEDPTSENXTDEKKIEEKKAQIVTEKNTSTKTSNPFFVAAQDSCTDYVTTDNLTK 523 

Qy 515 VTEAAVSNMPEGLTPDLVQEACESELNEATGTKIAYETKVDLVQTSEAIQESLYPTAQLC 574 

Ml hMMMMMMIIMIMM IIIIMIIIhlllllll MIIMI MM 

Db 524 VTEEWANMPEGLTPDLVQEACESELNEVTGTKIAYETKMDLVQTSEA/MQESLYPAAQLC 583 

Qy 575 PSFEEAEATPSPVLPDIVMEAPLNSLLPSAGASWQPSVSPLEAPPPVSYDSIKLEPENP 634 

I ; I i i II :ll MM Mill N Mill 

Db 584 PSFEESEATPSPVLPDIVMEAPLNSAVPSAGASVIQPSSSPLEA-SSVNYESIKHEPENP 642 

Qy 635 PPYEEAMNVALKALGTKEGIKEPESFNAAVQETEAPYISIACDLIKETKLSTEPSPDFSN 694 

M M II h h M I II Mllh MhMMMMMMMMIMM I I : I I I I : 
Db 643 PPYEEAMSVSLKVSGIKEEIKEPENINAALQETEAPYISIACDLIKETKLSAEPAPDFSD 702 

Qy 695 YSEIAKFEKSVPEHAELVEDSSPESEPVDLFSDDSI PEVPQTQEEAVMLMKESLTEVS-E 753 

MIMI h IIMMIIIIIMMMIIIIMIIIhlll hi MIMIIMI I I 

Db 703 YSEMAKVEQPVPDHSELVEDSSPDSEPVDLFSDDSIPDVPQKQDETVMLVKESLTETSFE 762 

Qy 754 TVAQHK-EERLSASPQELGKPYLESFQPNLHSTKDA-ASNDI PTLTKKEKISLQMEEFNT 811 

- MMM I I lllllllh M MM = = : IIMIIII Mill M 

Db 763 SMIEYENKEKLSALPPEGGKPYLESFKLSLVNTKDTLLPDEVSTLSKKEKIPLQMEELST 822 

Qy 812 AI YSNDDLLSSKEDKIKESETFSDSSPIEI IDEFPTFVSAKDDS-PKLAKEYTDLEVSDK 870 

IMIIMI III MMMIMIMMMIIMM MM II IMMMIMM I 

Db 823 AVYSNDDLFISKEAQIRETETFSDSSPIEI IDEFPTLISSKTDSFSKLAREYTDLEVSHK 882 

Qy 871 SEIANIQSGADSLPCLELPCDLSFKNI YPK- -DEVHVSDEFSENRSSVSKASISPSNVSA 928 

IMM II MM Ml Ml Ml II Ihlhl h II I MM 

Db 8 83 SEIANAPDGAGSLPCTELPHDLSLKNIQPKVEEKISFSDDFSKNGSATSKVLLLPPDVSA 942 

Qy 92 9 L - EPQTEMGS I VKSKSLTKEAEKKLPSDTEKEDRSLSAVLSAELSKTSVVDLLYWRDI KK 987 

I I I MM I I II II 1 1 1 II II I M II I Ih MM MMMMMIMM 

Db 943 LGHTQAE I ES I VKPKVLEKEAEKKLPSDTEKEDRSPSAI FSADLGKTS WDLLYWRDI KK 1002 

Qy 988 TG WFGASLFLLLSLTVFS I VS VTAY I ALALLS VT I S FR I YKGVT QA I QKSDEGHPFRAY 1047 



I I I Ml I I I M I I U ; I I I M I.I I Mill h 

Db 1003 TGWFGASLFLLLSLTVFSIVSVTAYIALA.LLSVTISFRI YKGVIQAIQKSDEGHPFRAY 1062 

Qy 1048 LESEVAISEELVQKYSNSALGHVNSTIKELRRLFLVDDLVDSLKFAVLMWVFTYVGALFN 1107 

MMIIIIIIIIIIIIIIIIIIII IIIIIIIIIIIMIIIIIMIIIIIIIIIIIIMI 

Db 1063 LESEVAI SEELVQKYSNSALGHVNCTI KELRRLFLVDDLVDSLKFAVLMWVFTYVGALFN 1122 



Qy 1108 GLTLLI LALI SLFSI PVI YERHQVQI DHYLGLANKSVKDAMAKI QAKI PGLKRKAD 1163 

IMIIMIIIIIIhllllllll IMIMIIIIhlllMIIIIMIIIIIMI: 

Db 1123 GLTLLI LAL I SLFS VPVI YERHQAQ I DHYLGLANKNVKDAMAKI QAKI PGLKRKAE 1178 



RESULT 13 
AAU33228 

ID AAU33228 standard; Protein; 1246 AA. 
XX 

AC AAU33228; 
XX 

DT 18-DEC-2001 (first entry) 
XX 

DE Novel human secreted protein #3719. 
XX 

KW Human; vaccination; gene therapy; nutritional supplement; 

KW stem cell proliferation; haematopoiesis ; nerve tissue regeneration; 

KW immune suppression; immune stimulation; ant i- inflammatory; leukaemia. 

XX 

OS Homo sapiens. 
XX 

PN WO200179449-A2 . 
XX 

PD 25-OCT-2001. 
XX 

PF 16-APR-2001; 2 00 1WO-US08 656 . 
XX 

PR 18-APR-2000; 2000US-0552929 . 

PR 26-JAN-2001; 2001US-0770160 . 
XX 

PA (HYSE-) HYSEQ INC. 
XX 

PI Tang YT, Liu C, Drmanac RT; 
XX 

DR WPI; 2001-611725/70. 
XX 

PT Nucleic acids encoding a range of human polypeptides, useful in genetic 

PT vaccination, testing and therapy - 

XX 

PS Claim 20; Page 737; 765pp ; English. 
XX 

CC The invention relates to novel human secreted polypeptides. The 

CC polypeptides and antibodies to the polypeptides are useful for 

CC determining the presence of or predisposition to a disease associated 

CC with altered levels of polypeptide. The polypeptides are also useful for 

CC identifying agents (agonists and antagonists) that bind to them. Cells 

CC expressing the proteins are useful for identifying a therapeutic agent 

CC for use in treatment of a pathology related to aberrant expression or 

CC physiological interactions of the polypeptide. Vectors comprising 

CC the nucleic acids encoding the polypeptides and cells genetically 



CC engineered to express them are also useful for producing the proteins. 

CC The proteins are useful in genetic vaccination, testing and 

CC therapy, and can be used as nutritional supplements. They may be used to 

CC increase stem cell proliferation; to regulate haematopoiesis ; and in 

CC bone, cartilage, tendon and/or nerve tissue growth or regeneration; 

CC immune suppression and/or stimulation; as ant i- inflammatory agents; and 

CC in treatment of leukaemias. AAU2 9510-AAU333 04 represent the amino acid 

CC sequences of novel human secreted proteins of the invention. 
XX 

SQ Sequence 1246 AA; 

Query Match 70.4%; Score 4116; DB 22; Length 1246; 

Best Local Similarity 72.0%; Pred. No. 2.6e-206; 

Matches 873; Conservative 112; Mismatches 171; Indels 56; Gaps 27; 
Qy 1 MEDIDQSSLVSSSTDSPPRPPPAFKYQFVTEPEDEEDEEEEEDEEEDDEDLEELEVLERK 60 

llhlll Mill llllll llllllll MINI l, ! 1 , 

Db 42 MEDLDQSPLVSSS-DSPPRPQPAFKYQFVREPEDEE-EEEEEEEEDEDEDLEELEVLERK 99 

Qy 61 PAAGLSAAAVP- -PAAAAPLLDFSSDSVPPAPRGPLPAAPPAAPERQPSWERSPAA P 115 

llllllll II III llhll H IIIIIIIIIIIIM MINIM: II : I 
Db 100 PAAGLSAAPVPTAPAAGAPLMDFGNDFVPPAPRGPLPAAPPVAPERQPSWDPSPVSSTVP 159 

Qy 116 APSLPPAAAVLPSKLPEDDEPPARPPPPPPAGASPLAE PAAPPSTPAAPKR 166 

III 1 1 1 1 I ! I I I I I || || Illllllllllll 

Db 160 APSPLSAAAVSPSKLPEDDEPPARPPPPPPASVSPQAEPVWTPPAPAPAAPPSTPAAPKR 219 

Qy 167 RG-SGSVDETLFALPAASEPVIPSSAEKIMDLMEQPGNTVSSGQEDFPSVLLETAASLPS 225 

II MIIIMIIIMIIIIIII 1 1 1 1 III 

Db 220 RGSSGSVDETLFALPAASEPVIRSSAEN-MDLKEQPGNTISAGQEDFPSVLLETAASLPS 2 78 

Qy 226 LS PLSTVSFKEHGYLGNLSAVSSSEGTI EETLNEASKELPERATNPFVNRDLAEFSELEY 2 85 

Mill Mill llllll I II \ \ \ hi I lllllll 

Db 279 LS PLSAASFKEHEYLGNLSTVLPTEGTLQENVSEASKEVSEKAKTLLI DRDLTEFSELEY 338 

Qy 286 SEMGSSFKGSPKGESAILVENTKEEVIVRSKDKED-LVCSAALHSPQESPVG KED 339 

lllllll III MhM I : I h II - I i : I : II : Ih M I Ml 
Db 33 9 SEMGSSFSVSPKAESAVIVANPREEIIVKNKDEEEKLVSNNILHNQQELPTALTKLVKED 3 98 

Qy 340 RWS PEKTMDI FNEMQMS WAPVREEYADFKPFEQAWE VKDTYEGSRDVLAA RANV 3 95 

Ml II I III ! MMMIMIMM: INN: I I hill M: 

Db 399 EVVSSEKAKDSFNEKRVAVEAPMREEYADFKPFERWEVKDSKEDS - DMLAAGGKI ESNL 457 

Qy 3 96 ESKVDRKCLEDSLEQKSLGKDSEGRNEDASFPSTPEPVKDSSRAYITCASFT-SATESTT 4 54 

MMIMI Mill : MM hi lllllll Ml I llllll I Mill 

Db 458 ESKVDKKCFADSLEQTNHEKDSESSNDDTSFPSTPEGIKDRSGAYITCAPFNPAATESIA 517 

Qy 455 ANTFPLLEDHTSENKTDEKKIEERKAQI ITEK-TSPKTSNPFLVAVQDSEADYVTTDTLS 513 

I MM I llllllllllllhlllhlll M MMMMI MM llllll I 

Db 518 TNI FPLLGDPTSENKTDEKKI EEKKAQI VTEKNTSTKTSNPFLVAAQDSETDYVTTDNLT 577 

Qy 514 KVTEAAVSNMPEGLTPDLVQEACESELNEATGTKIAYETKVDLVQTSEAIQESLYPTAQL 573 

MM M 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 : 1 1 1 1 1 1 1 1 1 : 1 M I M MIMM III 

Db 578 KVTEEWANMPEGLTPDLVQEACESELNEVTGTKIAYETKMDLVQTSEVMQESLYPAAQL 637 

Qy 574 CPSFEEAEATPSPVLPDIVMEAPLNSLLPSAGASWQPSVSPLEAPPPVSYDSIKLEPEN 633 

1 1 1 1 1 h 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 IIMMhIM Mill MhM! MM 



Db 638 CPSFEESEATPSPVLPDIVMEAPLNSAVPSAGASVIQPSSSPLEA-SSWYESIKHEPEN 696 



Qy 


634 


PPPYEEAMNVALKAL-GTKEGIKEPESFNAAVQETEAPYISIACDLIKETKLSTEPSPDF 


692 


Db 


697 


IMMMIMM ^ 1 II llllh 1 H 1 1 Ml M II I M 1 1 1 II Ml 

P P P YEEAMS VSLKKVSG IKEEI KEPEN I NAALQETEAP Y I S I ACDL I KETKLSAE PA PDF 


756 


Qy 


693 


SNYSEIAKFEKSVPEHAELVEDSSPESEPVDLFSDDSIPEVPQTQEEAVMLMKESLTEVS 752 
1 . 1 1 ! - 1 1 1 . 1 I . 1 . 1 1 1 1 I I I I . 1 I I I .mi I I mi. 


Db 


757 


i*tii*ii I * I i ■ 1 • 1 1 1 1 1 1 1 1 • 1 1 1 i 1 1 1 1 1 1 1 1 1 - M 1 1*1 1 M - M II II 1 
SDYSEMAKVEQPVPDHSELVEDSSPDSEPVDLFSDDSIPDVPQKQDETVMLVKESLTETS 


816 


Qy 


753 


-ETVAQHK-EERLSASPQELGKPYLESFQPNLHSTKDA-ASNDIPTLTKKEKISLQMEEF 
1 . I Mi I i i i i i i i j i i lit ii i i i i i i i i t i 


809 


Db 


817 


1 : : : : : * 1 : 1 M 1 1 11111111= = 1 : 1 1 1 •'■>•* 1 1 = 1 1 1 1 1 1 1 1 II 
FESMI EYENKEKLSALPPEGGKPYLESFKLSLDNTKDTLLPDEVSTLSKKEKI PLQMEEL 


876 


Qy 


810 


NTAIYSNDDLLSSKEDKIKESETFSDSSPIEI IDEFPTFVSAKDDS-PKLAKEYTDLEVS 
.11.111111 III I I I I i I i i i i i i i i i i i i i i i ii iii i i i i i i i i 


868 


Db 


877 


" 1 1 : 1 1 1 1 1 1 III : 1 • h 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 Ml || | | | : | | | | | | | | 
STAVYSNDDLFI SKEAQI RETETFSDSSPI EI I DEFPTLI SSKTDSFSKLAREYTDLEVS 


936 


Qy 


869 


DKSEIANIQSGADSLPCLELPCDLSFKNIYPK- -DEVHVSDEFSENRSSVSKASISPSNV 
1 1 1 1 1 1 || i i i i in mi in i i i i i i i i ii i i 


926 


Db 


937 


1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II : : : 1 1 ' 1 M 1 M 1 1 MM 
HKSEIANAPDGAGSLPCTELPHDLSLKNIQPKVEEKISFSDDFSKNGSATSKVLLLPPDV 


996 


Qy 


927 


SALEPQTEMGSIVKSKSLTKEAEKKLPSDTEKEDRSLSAVLSAELSKTSWDLLYWRDIK 
III I l I I I I I i i i i i i i i i i i i i i i i i i ii i i i i i i i i i i i i i i i i i i i 


986 


Db 


997 


III 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 h 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

SALATQAEIESIVKPKVLVKEAEKKLPSDTEKEDRSPSAIFSAELSKTSWDLLYWRDIK 


1056 


Qy 


987 


KTGVVFGAS-LFLLLSLTVFSI VSVTAYIALALLSVTISFRI YKGVIQAIQKSDEGHPFR 


1045 


Db 


1057 


MINIMI MIMIMMMMMMMMMMMMMIMMMMMMIMM 

KTGWFGASAVFLLLSLTVFS I VSVTAYIALALLSVTISFRI YKGVIQAIQKSDEGHPFR 


1116 


Qy 


1046 


AY- - -LESEVAI SEELVQKYSNSALGHV-NSTI KELRR- - - LFLVDDLVDSLK- FAVLMW 


1097 


UD 


1117 


1 MM : 1 MIMM MM 1 1 MMIh MIMI 

AISGNLESCLYLRELGSGRYSNSALGSMWNCTVKGNFRAPSFFSWMDLVDSLRSFAVLMW 


1176 


Qy 


1098 


VFT YVGALFNGLTLL 1 LAL I SLFS I PVI YERHQVQ I DH YLGLANKS VKDAMAKI 


1151 


Db 


1177 


MUM 1 IMI 1 II MM MM III MINI III III 

VFTYVGCL- -GLMVLDTTGFWALNFISSSGSWLI YERHQAQIDHYLGLANKNVKDAMAKI 


1234 


Qy 


1152 


QAKI PGLKRKAD 1163 




Db 


1235 


IMIIMMIh 

QAKI PGLKRKAE 124 6 





RESULT 14 
AAY71562 

ID AAY71562 standard; Protein; 803 AA. 
XX 

AC AAY71562; 
XX 

DT 02-NOV-2000 (first entry) 
XX 

DE Rat Nogo A protein fragment used in the construction of mutant NiG. 
XX 

KW Rat; neurite growth inhibitor; Nogo A; neural cell; myelin; CNS; 

KW central nervous system; neoplastic disease; antiproliferative; glioma; 

KW antisense gene therapy; neuroblastoma; menagioma; retinoblastoma; 

KW degenerative nerve disease; Alzheimer's disease; Parkinson's disease; 



KW hyperprol iterative disorder; benign dysprol iterative disorder; diagnosis; 

KW psoriasis; tissue hypertrophy; neuronal regeneration; treatment ; 

KW structural plasticity; screening; mutant; mutein. 
XX 

OS Rattus sp. 
XX 

PN WO200031235-A2 . 
XX 

PD 02-JUN-2000. 
XX 

PF 05-NOV-1999; 99WO-US2 6160 . 
XX 

PR 06 -NOV- 19 98 ; 98US- 0107446 . 
XX 

PA (SCHW/) SCHWAB M E. 

PA (CHEN/) CHEN M S. 
XX 

PI Schwab ME, Chen MS; 
XX 

DR WPI; 2000-400052/34. 
XX 

PT Nogo proteins and nucleic acids useful for treating neoplastic 

PT disorders of the central nervous system and inducing regeneration of 

PT neurons - 

XX 

PS Example; Page 122pp; English. 
XX 

CC The patent relates to neurite growth inhibitor Nogo which is free of 

CC all central nervous system (CNS) myelin material with which it is 

CC natively associated. Nogo proteins and fragments displaying neurite 

CC growth inhibitory activity are used in the treatment of neoplastic 

CC disease of the CNS e.g. glioma, glioblastoma, medulloblastoma, 

CC craniopharyngioma, ependyoma, pinealoma, haemangioblastoma , acoustic 

CC neuroma, oligodendroglioma, menagioma, neuroblastoma or retinoblastoma 

CC and degenerative nerve diseases e.g. Alzheimer's and Parkinson's 

CC diseases. Therapeutics which promote Nogo activity can be used to treat 

CC or prevent hyperprol iterative or benign dysprol iterative disorders e.g. 

CC psoriasis and tissue hypertrophy. Ribozymes or antisense Nogo nucleic 

CC acids can be used to inhibit production of Nogo protein to induce 

CC regeneration of neurons or to promote structural plasticity of the CNS 

CC in disorders where neurite growth, regeneration or maintenance are 

CC deficient or desired. The animal models can be used in diagnostic and 

CC screening methods for predisposition to disorders and to screen for or 

CC test molecules which can treat or prevent disorders or diseases of the 

CC CNS. The present sequence is a fragment of rat Nogo A protein shown in 

CC AAY71310, which is used in the construction of mutant NiG. The mutant 

CC is composed of His-tag/T7-tag/Nogo-A sequence aa 172-974/His-tag. 

CC Nogo A deletion mutants were used for mapping the inhibitory sites of 

CC Nogo protein. Major inhibitory region was identified in the 

CC Nogo A sequence from amino acids 172-974, particularly amino acids 

CC 542-722. In addition, N-terminal region 1-171 was found to be inhibitory 

CC to NIH 3T3 fibroblast spreading. 

CC Note: The present sequence is not given in the specification but is 

CC derived from rat Nogo A sequence shown in AAY71310. SEQ ID numbers 35-42 

CC are referred in claim 32 and SEQ ID NO: 29 in disclosure of the 

CC specification. However, the specification does not include sequences for 

CC these SEQ ID numbers. 



XX 

SQ Sequence 8 03 AA; 



Query Match 68.8%; Score 4023; DB 21; Length 803; 

Best Local Similarity 99.9%; Pred. No. l.le-201; 

Matches 802; Conservative 1; Mismatches 0; Indels 0; Gaps 0; 
Qy 172 VDETLFALPAASEPVI PSSAEKIMDLMEQPGNTVSSGQEDFPSVLLETAASLPSLSPLST 231 

Ml! MIMMMIMM MIMMMIMM MMMIMIMM IMMII 

Db 1 VDETLFALPAASEPVI PSSAEKIMDLMEQPGNTVSSGQEDFPSVLLETAASLPSLSPLST 60 

Qy 232 VSFKEHGYLGNLSAVSSSEGTIEETLNEASKELPERATNPFVNRDLAEFSELEYSEMGSS 291 

MM MINIUM III III IIIIIMIIIIIII llllll III 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

Db 61 VSFKEHGYLGNLSAVSSSEGTIEETLNEASKELPERATNPFVNRDLAEFSELEYSEMGSS 12 0 

Qy 292 FKGS PKGESAI LVENTKEE VI VRSKDKEDLVCSAALHS PQES PVGKEDR WS PEKTMD I F 351 

II IMMMMIMM MliMIMIMM MIIIIIMIMIII IIIIIMI 

Db 121 FKGS PKGESA I LVENTKEE VI VRSKDKEDLVCSAALHS PQES PVGKEDRWS PEKTMD IF 180 

Qy 352 NEMQMSWAPWEEYADFKPFEQAWEVKDTYEGSRDVLAARANVESKVDRKCLEDSLEQK 411 

M 

Db 181 NEMQMSWAPWEEYADFKPFEQAWEVKDTYEGSRDVLAARANVESKVDRKCIEDSLEQK 24 0 

Qy 412 SLGKDSEGRNEDASFPSTPEPVKDSSRAYITCASFTSATESTTANTFPLLEDHTSENKTD 471 

MIMIIIIMIMM MMM llllll llllll Mill III III Mill III I 

Db 241 SLGKDSEGRNEDASFPSTPEPVKDSSRAYITCASFTSATESTTANTFPLLEDHTSENKTD 300 

Qy 472 EKKIEERKAQI ITEKTSPKTSNPFLVAVQDSEADYVTTDTLSKVTEAAVSNMPEGLTPDL 531 

MMMMMMMMMMMMMMMMMMMMIMIMMMMIMIMM 

Db 301 EKKIEERKAQI ITEKTSPKTSNPFLVAVQDSEADYVTTDTLSKVTEAAVSNMPEGLTPDL 360 

Qy 532 VQEACESELNEATGTKIAYETKVDLVQTSEAIQESLYPTAQLCPSFEEAEATPSPVLPDI 591 

M ii mmmmmmmmimimm iiiiM , ;i i I i mi i 

Db 361 VQEACESELNEATGTKIAYETKVDLVQTSEAIQESLYPTAQLCPSFEEAEATPSPVLPDI 420 

Qy 592 VMEAPLNSLLPSAGASWQPSVSPLEAPPPVSYDSIKLEPENPPPYEEAMNVALKALGTK 651 

1 1 . : 1 1 i i : 1 1 1 1 1 1 i i i i 1 1 1 1 1 1 1 1 1 1 1 1 1 i i : I I 1 1 

Db 421 VMEAPLNSLLPSAGASWQPSVSPLEAPPPVSYDSIKLEPENPPPYEEAMNVALKALGTK 480 

Qy 652 EGI KEPESFNAAVQETEAPYI S IACDLI KETKLSTEPSPDFSNYSEIAKFEKSVPEHAEL 711 

I 

Db 481 EGI KEPESFNAAVQETEAPYI S IACDLI KETKLSTEPSPDFSNYSEIAKFEKSVPEHAEL 540 

Qy 712 VEDSSPESEPVDLFSDDSI PEVPQTQEEAVMLMKESLTEVSETVAQHKEERLSASPQELG 771 

IMIIIIIIIIIIIIIIMMIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIM 

Db 541 VEDSSPESEPVDLFSDDSI PEVPQTQEEAVMLMKESLTEVSETVAQHKEERLSASPQELG 600 

Qy 772 KPYLESFQPNLHSTKDAASNDI PTLTKKEKISLQMEEFNTAI YSNDDLLSSKEDKIKESE 831 

MMMIIIMIIIIIIIIIIIIIIIIIIIIIIMIMIIIIIIMMIIIIIIMIMI 

Db 601 KPYLESFQPNLHSTKDAASNDI PTLTKKEKISLQMEEFNTAI YSNDDLLSSKEDKIKESE 660 

Qy 832 TFSDSSPIEI IDEFPTFVSAKDDSPKLAKEYTDLEVSDKSEIANIQSGADSLPCLELPCD 891 

I I I I I I ' ' I I I M I. I I I'l : I I i.M hi I 

Db 661 TFSDSSPIEI I DEFPTFVSAKDDSPKLAKEYTDLEVSDKSEIANIQSGADSLPCLELPCD 720 

Qy 892 LSFKNIYPKDEVHVSDEFSENRSSVSKASISPSNVSALEPQTEMGSI VKSKSLTKEAEKK 951 

1 1 1 1 Ml I M I Ml MM M 1 1 MM I M M 1 1 1 MIMM 1 1 1 IMM I Ml Ml 



Db 



721 LSFKNI YPKDEVHVSDEFSENRSSVSKASISPSNVSALEPQTEMGSIVKSKSLTKEAEKK 780 



Qy 952 IjPSDTEKEDRSLSAVLSAELSKT 974 

! M I Mil 1 1 1 1 -1 1 1 1 I MM 

Db 781 LPSDTEKEDRSLSAVLSAELSKT 803 



RESULT 15 
AAY71386 

ID AAY71386 standard; Protein; 737 AA. 
XX 

AC AAY71386; 
XX 

DT 02-NOV-2 000 {first entry) 
XX 

DE Rat Nogo A protein fragment used in the construction of mutant NiG-Dl . 
XX 

KW Rat; neurite growth inhibitor; Nogo A; neural cell; myelin; CNS; 

KW central nervous system; neoplastic disease; antiproliferative; glioma ; 

KW antisense gene therapy; neuroblastoma; menagioma; retinoblastoma; 

KW degenerative nerve disease; Alzheimer's disease; Parkinson's disease; 

KW hyperprol iterative disorder; benign dysprol iterative disorder; diagnosis; 

KW psoriasis; tissue hypertrophy; neuronal regeneration; treatment; 

KW structural plasticity; screening; mutant; mutein. 

XX 

OS Rattus sp. 
XX 

PN WO200031235-A2 . 
XX 

PD 02-JUN-2000. 
XX 

PF 05-NOV-1999; 99WO-US26160 . 
XX 

PR 06-NOV-1998; 98US- 0107446 . 
XX 

PA (SCHW/) SCHWAB ME. 

PA (CHEN/) CHEN M S. 
XX 

PI Schwab ME, Chen MS; 
XX 

DR WPI; 2000-400052/34. 
XX 

PT Nogo proteins and nucleic acids useful for treating neoplastic 

PT disorders of the central nervous system and inducing regeneration of 

PT neurons - 

XX 

PS Example; Page -; 122pp ; English. 
XX 

CC The patent relates to neurite growth inhibitor Nogo which is free of 

CC all central nervous system (CNS) myelin material with which it is 

CC natively associated. Nogo proteins and fragments displaying neurite 

CC growth inhibitory activity are used in the treatment of neoplastic 

CC disease of the CNS e.g. glioma, glioblastoma, medulloblastoma , 

CC craniopharyngioma, ependyoma, pinealoma, haemangioblastoma , acoustic 

CC neuroma, oligodendroglioma, menagioma, neuroblastoma or retinoblastoma 

CC and degenerative nerve diseases e.g. Alzheimer's and Parkinson's 

CC diseases. Therapeutics which promote Nogo activity can be used to treat 



CC or prevent hyperprol iterative or benign dysprol iterative disorders e.g. 

CC psoriasis and tissue hypertrophy. Ribozymes or antisense Nogo nucleic 

CC acids can be used to inhibit production of Nogo protein to induce 

CC regeneration of neurons or to promote structural plasticity of the CNS 

CC in disorders where neurite growth, regeneration or maintenance are 

CC deficient or desired. The animal models can be used in diagnostic and 

CC screening methods for predisposition to disorders and to screen for or 

CC test molecules which can treat or prevent disorders or diseases of the 

CC CNS . The present sequence is a fragment of rat Nogo A protein shown in 

CC AAY71310, which is used in the construction of mutant NiG-Dl. NiG-Dl 

CC is composed of His-tag/T7-tag/Nogo-A sequence aa 172 -908/vector . 

CC Nogo A deletion mutants were used for mapping the inhibitory sites of 

CC Nogo protein. Major inhibitory region was identified in the 

CC Nogo A sequence from amino acids 172-974, particularly amino acids 

CC 542-722. In addition, N-terminal region 1-171 was found to be inhibitory 

CC to NIH 3T3 fibroblast spreading. 

CC Note: The present sequence is not given in the specification but is 

CC derived from rat Nogo A sequence shown in AAY71310. SEQ ID numbers 35-42 

CC are referred in claim 32 and SEQ ID NO: 29 in disclosure of the 

CC specification. However, the specification does not include sequences for 

CC these SEQ ID numbers. 

XX 

SQ Sequence 737 AA; 

Query Match 63.5%; Score 3714; DB 21; Length 737; 

Best Local Similarity 99.9%; Pred. No. 1.2e-185; 

Matches 736; Conservative 1; Mismatches 0; Indels 0; Gaps 0; 
Qy 172 VDETLFALPAASEPVI PSSAEKIMDLMEQPGNTVSSGQEDFPSVLLETAASLPSLSPLST 231 

I Ml I I I I : I I I I I I MM i I I l I I I 

Db 1 VDETLFALPAASEPVI PSSAEKIMDLMEQPGNTVSSGQEDFPSVLLETAASLPSLSPLST 60 

Qy 232 VSFKEHGYLGNLSAVSSSEGTIEETLNEASKELPERATNPFVNRDLAEFSELEYSEMGSS 2 91 

I ll , 1 1 M I M II I MM ll II II M II II II M II III II II 1 1 IM II I 

Db 61 VSFKEHGYLGNLSAVSSSEGTIEETLNEASKELPERATNPFVNRDLAEFSELEYSEMGSS 120 

Qy 292 FKGSPKGESAILVENTKEEVIVRSKDKEDLVCSAALHSPQESPVGKEDRWSPEKTMDIF 351 

I 1 f ! I I I I : I M I M I I I I I I MM 

Db 121 FKGSPKGESAILVENTKEEVI VRSKDKEDLVCSAALHSPQESPVGKEDRWSPEKTMDIF 180 

Qy 352 NEMQMSWAPWEEYADFKPFEQAWEVKDTYEGSRDvliAARANVESKVDRKCLEDSLEQK 411 

M I : I M M I I i I I , Ml I I I ( i M; | |,| 

Db 181 NEMQMSWAPWEEYADFKPFEQAWEVKDTYEGSRDVLAARANVESKVDRKCIEDSLEQK 24 0 

Qy 412 SLGKDSEGRNEDASFPSTPEPVKDSSRAYITCASFTSATESTTANTFPLLEDHTSENKTD 471 

IMIIIIIIIIIIIIIMIIIIIIIIIMIIIIIIIIIMMIIMIIIIMIIIMIII 

Db 241 SLGKDSEGRNEDASFPSTPEPVKDSSRAYITCASFTSATESTTANTFPLLEDHTSENKTD 300 

Qy 472 EKKIEERKAQI ITEKTSPKTSNPFLVAVQDSEADYVTTDTLSKVTEAAVSNMPEGLTPDL 531 

I I I MMI I I I I I MMI I I Ml Ml i Ml Ml I 

Db 301 EKKIEERKAQI I TEKTSPKTSNPFLVA VQDSEADYVTTDTLSKVTEAAVSNMPEGLTPDL 360 

Qy 532 VQEACESELNEATGTKIAYETKVDLVQTSEAIQESLYPTAQLCPSFEEAEATPSPVLPDI 591 

I I 1 I Ml I I I I MMI I , I I I I I I I MM 

Db 361 VQEACESELNEATGTKIAYETKVDLVQTSEAIQESLYPTAQLCPSFEEAEATPSPVLPDI 420 

Qy 592 VMEAPLNSLLPSAGASWQPSVSPLEAPPPVSYDSIKLEPENPPPYEEAMNVALKALGTK 651 



I I : I I I I I I I : I I I < I I.!: I I I ! I ' I I 

VMEAPLNSLLPSAGASWQPSVSPLEAPPPVSYDSIKLEPENPPPYEEAMNVALKALGTK 4 80 



Db 


421 


VMEAPLNSLLPSAGASWQPSVSPLEAPPPVSYDSIKLEPENPPPYEEAMNVALKALGTK 480 


Qy 


652 


EGIKEPESFNAAVQETEAPYISIACDLIKETKLSTEPSPDFSNYSEIAKFEKSVPEHAEL 711 


Db 


481 


IIIIIIIMMMIIIIIIIIMIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIMI 

EGI KEPESFNAAVQETEAPYI S I ACDLI KETKLSTEPSPDFSNYSEIAKFEKSVPEHAEL 54 0 


Qy 


712 


VEDSSPESEPVDLFSDDSI PEVPQTQEEAVMLMKESLTEVSETVAQHKEERLSASPQELG 771 

IMM IMIIIII II IMIIIMM Mlllllllll IMMIIMM MM 


Db 


541 


VEDSSPESEPVDLFSDDSI PEVPQTQEEAVMLMKESLTEVSETVAQHKEERLSASPQELG 6 00 


Qy 


772 


KPYLESFQPNLHSTKDAASNDI PTLTKKEKI SLQMEEFNTAI YSNDDLLSSKEDKI KESE 831 


Db 


601 


IIMII Mlllllllll IMMMIM MMMMIMMMIMMM MM 

KPYLESFQPNLHSTKDAASNDI PTLTKKEKI SLQMEEFNTAI YSNDDLLSSKEDKI KESE 660 


Qy 


832 


TFSDSSPIEI IDEFPTFVSAKDDSPKLAKEYTDLEVSDKSEIANIQSGADSLPCLELPCD 8 91 


Db 


661 


IIIIMMIIIIIIIIIIIMIIIIIIIIIIMIIIIMIMIIIIIMIIIIIMIIM 

TFSDSSPIEI IDEFPTFVSAKDDSPKLAKEYTDLEVSDKSEIANIQSGADSLPCLELPCD 720 


Qy 


892 


LSFKNI YPKDEVHVSDE 908 


Db 


721 


lllllll lllllll 

LSFKNI YPKDEVHVSDE 737 



Search completed: December 19, 2003, 15:35:11 
Job time 52 sees 



GenCore version 5.1.6 
Copyright (c) 1993 - 2003 Compugen Ltd. 



OM protein - protein search, using sw model 



Run on: 



Title: 

Perfect score: 
Sequence : 

Scoring table: 



December 19, 2 003, 15:34:13 ; Search time 21 Seconds 

(without alignments) 
2343.216 Million cell updates/sec 

US-09-830-972-2 
5848 

1 MEDIDQSSLVSSSTDSPPRP VKDAMAKI QAKI PGLKRKAD 1163 

BLOSUM62 

Gapop 10.0 , Gapext 0.5 



Searched : 



328717 seqs, 42310858 residues 



Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post -processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 



328717 



Database : Issued_Patents__AA: * 

1 : /cgn2_6/ptodata/l/iaa/5A_COMB .pep : * 

2 : / cgn2_6/ptodata/ 1/ iaa/5B_C0MB . pep : * 

3 : /cgn2_6/ptodata/l/iaa/6A_COMB.pep: * 

4 : /cgn2_6/ptodata/l/iaa/6B_COMB.pep: * 

5 : /cgn2_6/ptodata/l/iaa/PCTUS_COMB.pep: * 

6 : /cgn2_6/ptodata/l/iaa/backf ilesl .pep: * 

Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 

SUMMARIES 



Result - Query 

No. Score Match Length DB ID 



Description 



1 


908 


15 


5 


199 


2 


US- 


08 


-700 


-607-1 


Sequence 


1, Appli 


2 


789.5 


13 


5 


776 


2 


us- 


08 


-700 


-607-5 


Sequence 


5, Appli 


3 


716.5 


12 


3 


356 


2 


us- 


08 


-700 


-607-6 


Sequence 


6, Appli 


4 


685 


11 


7 


208 


2 


us- 


08 


-700 


-607-7 


Sequence 


7, Appli 


5 


671 


11 


5 


267 


2 


us- 


08 


-700 


-607-8 


Sequence 


8, Appli 


6 


539.5 


9 


2 


168 


4 


us- 


09 


-149 


-476-563 


Sequence 


563, App 


7 


519 


8 


9 


241 


2 


us- 


08 


-700 


-607-3 


Sequence 


3, Appli 


8 


302.5 


5 


2 


8991 


4 


us- 


08 


-714 


-741-32 


Sequence 


32, Appl 


9 


286 


4 


9 


92 


4 


us- 


09 


-149 


-476-411 


Sequence 


411, App 


10 


279.5 


4 


8 


1786 


3 


us- 


08 


-973 


-462-8 


Sequence 


8, Appli 


11 


265 . 5 


4 . 


5 


1780 


1 


us- 


08 


-769 


-309A-5 


Sequence 


5, Appli 



12 


265 . 5 


4 


. 5 


1780 


3 


US-08-994 


13 


265 


4 


. 5 


2137 


4 


US-09-134 


14 


2 54.5 


4 


. 4 


1596 


4 


US-08-978 


15 


233 


4 


. 0 


1142 


2 


US-08-993 


16 


233 


4 


. 0 


1142 


3 


US-08-845 


17 


233 


4 


. 0 


1142 


4 


US-09-066 


18 


232.5 


4 


. 0 


1805 


1 


US-07-853 


19 


228 


3 


. 9 


1142 


3 


US-09-061 


20 


228 


3 


. 9 


1142 


4 


US-09-899 


2 1 


227 


3 


9 


1270 


4 


US-07-757 


22 


227 


3 


9 


1311 


4 


US-07-757 


23 


227 


3 


9 


1313 


4 


US-07-757 


24 


227 


3 


9 


1314 


4 


US-07-757 


25 


22 7 


3 


9 


1320 


4 


US-07-757 


26 


227 


3 


9 


1320 


4 


US-07-757 


2 7 


227 


3 


9 


1354 


4 


US-07-757 


28 


227 


3 


9 


1361 


4 


US-07-757 


2 9 


227 


3 


9 


1363 


4 


US-07-757 


30 


227 


3 


9 


1404 


4 


US-07-757 


3 1 


227 


3 


9 


1404 


4 


US-07-757 


32 


225 


3 


8 


688 


3 


US-09-141 


33 


224 . 5 


3 . 


8 


941 


4 


US-07-757 


34 


224.5 


3 . 


8 


1022 


4 


US-07-757 


35 


224 . 5 


3 . 


8 


1038 


4 


US-07-757 


36 


224 . 5 


3 . 


8 


1049 


4 


US-07-757 


37 


224 . 5 


3 . 


8 


1140 


4 


US-07-757 


38 


224 . 5 


3 . 


8 


1346 


2 


US-08-635 


39 


224 . 5 


3 . 


8 


1346 


4 


US-08-978 


40 


224 


3 . 


8 


630 


3 


US-08-973 


41 


223 


3 . 


8 


2409 


6 


5180808-2 


42 


222 


3. 


8 


1312 


3 


US-09-041 


43 


222 


3 . 


8 


1312 


4 


US-09-648 


44 


220 


3. 


8 


1601 


4 


US-09-345 


45 


218 . 5 


3. 


7 


2842 


1 


US-07-741 



-570-5 


Sequence 


5 , Appl i 


-001C-4463 


Sequence 


4463, Ap 


-277A-4 


Sequence 


4, Appli 


-118-7 


Sequence 


7 , Appl i 


■528C-7 


Sequence 


7 , Appli 


-28 1B-7 


Sequence 


1, Appli 


-913-2 


Sequence 


2, Appli 


-709-2 


Sequence 


2, Appli 


■651-2 


Sequence 


2, Appli 


-022B-44 


Sequence 


44, Appl 


-022B-42 


Sequence 


42, Appl 


-022B- 142 


Sequence 


142, App 


-022B-50 


Sequence 


50, Appl 


•022B-46 


Sequence 


46, Appl 


022B-60 


Sequence 


60, Appl 


022B-48 


Sequence 


48, Appl 


022B-40 


Sequence 


40, Appl 


022B-52 


Sequence 


52, Appl 


022B-2 


Sequence 


2, Appli 


022B-62 


Sequence 


62, Appl 


047-8 


Sequence 


8, Appli 


022B-14 


Sequence 


14, Appl 


022B-84 


Sequence 


84, Appl 


022B-74 


Sequence 


74, Appl 


022B-58 


Sequence 


58, Appl 


022B-104 


Sequence 


104, App 


121-2 


Sequence 


2, Appli 


277A-2 


Sequence 


2, Appli 


462-9 


Sequence 


9, Appli 




Patent No. 


5180808 


886-19 


Sequence 


19, Appl 


281-2 


Sequence 


2, Appli 


473E-40 


Sequence 


40, Appl 


940-7 


Sequence 


1, Appli 



ALIGNMENTS 



RESULT 1 
US-08-700-607-1 

Sequence 1, Application US/08700607 
Patent No. 5858708 
GENERAL INFORMATION: 

APPLICANT: Bandman, Olga 
APPLICANT: Au- Young, Janice 
APPLICANT: Goli, Surya K. 
APPLICANT: Hillman, Jennifer L. 

TITLE OF INVENTION: TWO NOVEL HUMAN NSP-LIKE PROTEINS 
NUMBER OF SEQUENCES: 9 
CORRESPONDENCE ADDRESS : 

ADDRESSEE: Incyte Pharmaceuticals, Inc. 
STREET: 3174 Porter Drive 
CITY: Palo Alto 
STATE : CA 
COUNTRY : U.S. 
ZIP : 94304 



COMPUTER READABLE FORM: 
MEDIUM TYPE: Diskette 
COMPUTER: IBM Compatible 
OPERATING SYSTEM: DOS 
SOFTWARE: FastSEQ Version 1.5 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/08/700,607 
FILING DATE: Filed Herewith 
ATTORNEY/AGENT INFORMATION: 
NAME: Billings, Lucy J . 
REGISTRATION NUMBER: 36,749 
REFERENCE/DOCKET NUMBER: PF-0114 US 
TELECOMMUNICATION INFORMATION: 
TELEPHONE: 415-8 55-0555 
TELEFAX: 415-845-4166 
INFORMATION FOR SEQ ID NO: 1: 
SEQUENCE CHARACTERI STI CS : 
LENGTH: 199 amino acids 
TYPE: amino acid 
STRANDEDNESS : s ingl e 
TOPOLOGY: linear 
MOLECULE TYPE: peptide 
IMMEDIATE SOURCE: 
LIBRARY: 

CLONE: Consensus 
US-08-700-607-1 

Query Match 15.5%; Score 908; DB 2; Length 199; 

Best Local Similarity 96.3%; Pred. No. 1.2e-50; 

Matches 184; Conservative 3; Mismatches 4; Indels 0; Gaps 0; 

KTSWDLLYWRDI KKTGWFGASLFLLLSLTVFS I VSVTAYIALALLSVTI SFRI YKGVI 1032 

i 1 1 f 1 1 f 1 1 J 1 1 1 1 f 1 1 1 f 1 1 1 1 f 1 1 1 1 1 1 f j i f f i f i j i j j r i r f 1 1 1 r 1 1 j i j r r i 

KDKWDLLYWRDI KKTGWFGASLFLLLSLTVFS I VSVTAYIALALLSVTI SFRI YKGVI 68 
QAI QKSDEGHPFRAYLESEVAI SEELVQKYSNSALGHVNSTI KELRRLFLVDDLVDSLKF 1092 

IIIIIMIIMIIIIIIIIMMIIIIIIIIIIMIMI IIIIIIIMIIIIIIIIIII 

QAI QKSDEGHPFRAYLESEVAI SEELVQKYSNSALGHVNCTI KELRRLFLVDDLVDSLKF 128 
AVLMWVFTYVGALFNGLTLLI LAL I SLFS I PVI YERHQVQ I DHYLGLANKS VKDAMAKI Q 1152 

MMIIIIIIIIilllllllllllllllhllllllM IIMIIIIIIhlMIIIIM 

AVLMWVFTYVGALFNGLTLLI LALI SLFS VPVI YERHQAQIDHYLGLANKNVKDAMAKIQ 18 8 

AKIPGLKRKAD 1163 
lllllllllh 



Qy 


973 


Db 


9 


Qy 


1033 


Db 


69 


Qy 


1093 


Db 


129 


Qy 


1153 


Db 


189 



RESULT 2 
US-08-700-607-5 

Sequence 5, Application US/08700607 
Patent No. 5858708 
GENERAL INFORMATION: 

APPLICANT: Bandman, Olga 
APPLICANT: Au- Young, Janice 
APPLICANT: Goli, Surya K. 
APPLICANT: Hillman, Jennifer L. 



TITLE OF INVENTION: TWO NOVEL HUMAN NSP-LIKE PROTEINS 
NUMBER OF SEQUENCES: 9 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: Incyte Pharmaceuticals, Inc. 

STREET: 3174 Porter Drive 

CITY: Palo Alto 

STATE : CA 

COUNTRY : U.S. 

ZIP: 94304 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Diskette 

COMPUTER: IBM Compatible 

OPERATING SYSTEM: DOS 

SOFTWARE: FastSEQ Version 1.5 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/08/700 , 607 

FILING DATE: Filed Herewith 
ATTORNEY/AGENT INFORMATION: 

NAME: Billings, Lucy J. 

REGISTRATION NUMBER: 36,749 

REFERENCE/DOCKET NUMBER: PF-0114 US 
TELECOMMUNICATION INFORMATION: 

TELEPHONE: 415-855-0555 

TELEFAX: 415-845-4166 
INFORMATION FOR SEQ ID NO: 5: 
SEQUENCE CHARACTERISTICS: 

LENGTH: 776 amino acids 

TYPE: amino acid 

STRANDEDNESS: single 

TOPOLOGY: linear 
MOLECULE TYPE: peptide 
IMMEDIATE SOURCE: 

LIBRARY: GenBank 

CLONE: 307307 
US-08-700-607-5 



Query Match 13.5%; Score 78 9.5; DB 2; Length 776; 

Best Local Similarity 31.2%; Pred. No. 3.2e-42; 

Matches 243; Conservative 115; Mismatches 252; Indels 169; Gaps 28; 

QY 487 TS PKTSN P FLVAVQDS EAD YVTTDTLS KVTEAAVSNMP EGLTPDLVQEACES ELNEATGT 54 6 

: I : I II- : = I : I II :| : I 

65 SGPARQSP- -VAMETASTGVAGVSSAMDHTFSTTSKDGEG SCYTSLI S 110 

QY 54 7 KIAYETKVDLVQTSEAIQ-ESLYPTAQLCPSFEEAEATPSPVLPDI - -VMEAPLNS 599 

11=1 : :| h : | I I I I I ||h : | | 

Db m DICYPPQEDSTYFTGILQKENGHVTISESP EELGTPGPSLPDVPGI ESRGLFSSDSG 167 

QY 600 - -LLPSAGASWQPSVSPLEAPPPVSY DSIKLEPENPPPYEEA M 641 

: I : I = 11 = = 1 : : | : : : | |: 

Db 168 IEMTPAESTEVNKILADPLDQMKAEAYKYIDITRPEEVKHQEQHHPELEDKDLDFKNKDT 227 

Qy 642 NVALKALGTKEGI KEPE SFNAAVQETEAPYI S I ACDLI KETKLSTE - PS P 690 

I I = = M = : = I | M I II =| = : = =| 

Db 228 DISIK PEGVREPDKPAPVEGKI IKDHLLEESTFAPYID DLSEEQRRAPQITTP 280 



Qy 



691 DFSNYSEIAKFEKSVPEHAELVEDSSPESEPVDLF-SDDSI PEV PQTQEEAVMLMK 745 



Db 



281 VKITLTEIE PS VETTTQEKTPEKQDI CLKPS PDTVPTVTVSEPEDDS PGS I TPP 334 



Qy 746 ESLTEVSETVAQHK EERLSASPQELGKPYLESFQP NLHSTKDAASND 792 

I II I =| I :| ::| : | | : | | 

Db 335 SSGTEPSAAESQGKGSISEDELITAIKEAKGLSYETAENPRPVGQLADRPEVKARSGPPT 394 

Qy 793 IPTLTKKEKISLQMEEFNTAIYSNDDLLSSKEDKI 827 

||: | | : : : | | : : || : 

Db 3 95 IPSPLDHEASSAESGDSEIELVSEDPM- -AAEDALPSGYVSFGHVGGPPPSPASPSIQYS 4 52 

Qy 828 KESETFSDSSPIEI IDEFPTFVSAKDDSPKLAKEYTDLEVSDKSEI ANIQSG 879 

:| I II hi I II -III - == I I 

Db 453 ILREEREAELDS ELI I ESCDASSASEESPKREQDSPPMKPSALDAIREETGVRAEER 509 

Qy 880 ADSLPCLELPCDLSFKNI YPKDEVHVSDEFSENRSSVSKAS I S PSNVSALEPQTEMGS I V 939 

Db 510 APSRRGLAEPG- -SFLD-YPSTEPQPGPEL PPGDGALEPETPM 54 9 

Qy 940 KSKSLTKEAEKKLPSDTEKEDRSLSAVLSAE LSKTSWDLLYWRD 984 

II | = : | : : : H :||||||| 

Db 550 LPRKPEEDSSSNQSPAATKGPGPLGPGAPPPLLFLNKQKAIDLLYWRD 597 

Qy 985 IKKTGWFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVIQAIQKSDEGHPF 1044 

Ihlhilh I II III Ihlll Ihlll II llilllll hlhlhllllll 

Db 598 IKQTGIVFGSFLLLLFSLTQFSWSWAYLALAALSATISFRIYKSVLQAVQKTDEGHPF 657 

Qy 104 5 RAYLESEVAI SEELVQKYSNSALGHWSTI KELRRLFLVDDLVDSLKFAVLMWVFTYVGA 1104 

HIM h :hl Hlh: MllhHIIIIIII IIIIIIIIMMh Mill 

Db 658 KAYLELEITLSQEQIQKYTDCLQFYVNSTLKELRRLFLVQDLVDSLKFAVLMWLLTYVGA 717 

Qy 1105 LFNGLTLLILALISLFSI PVIYERHQVQIDHYLGLANKSVKDAMAKIQAKI PGLKRKAD 1163 

I I I I I I I I - h ^ h I - I h I HI III MM : MIMMIII II h 

Db 718 LFNGLTLLLMAVVSMFTLPVVYVKHQAQIDQYLGLVRTHINAVVAKI QAKI PGAKRHAE 77 6 



RESULT 3 
US-08-700-607-6 

Sequence 6, Application US/08700607 
Patent No. 5858708 
GENERAL INFORMATION: 

AP PLI CANT ; Bandman , 01 ga 
APPLICANT: Au- Young, Janice 
APPLICANT: Goli, Surya K. 
APPLICANT: Hillman, Jennifer L. 

TITLE OF INVENTION: TWO NOVEL HUMAN NSP-LIKE PROTEINS 
NUMBER OF SEQUENCES: 9 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: Incyte Pharmaceuticals, Inc. 
STREET: 3174 Porter Drive 
CITY: Palo Alto 
STATE : CA 
COUNTRY : U.S. 
ZIP: 94304 
COMPUTER READABLE FORM: 
MEDIUM TYPE: Diskette 
COMPUTER: IBM Compatible 



OPERATING SYSTEM: DOS 

SOFTWARE: FastSEQ Version 1.5 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/08/700,607 

FILING DATE: Filed Herewith 
ATTORNEY/ AGENT INFORMATION: 
; NAME: Billings, Lucy J. 

REGISTRATION NUMBER: 36,749 

REFERENCE/DOCKET NUMBER: PF-0114 US 
TELECOMMUNICATION INFORMATION: 

TELEPHONE: 415-855-0555 

TELEFAX: 415-845-4166 
; INFORMATION FOR SEQ ID NO: 6: 
SEQUENCE CHARACTERISTICS: 

LENGTH: 356 amino acids 

TYPE: amino acid 

STRANDEDNESS : single 

TOPOLOGY: linear 
MOLECULE TYPE: peptide 
IMMEDIATE SOURCE: 

LIBRARY: GenBank 

CLONE: 3073 09 
US-08-700-607-6 

Query Match 12.3%; Score 716.5; DB 2; Length 356; 

Best Local Similarity 46.1%; Pred. No. 4.6e-38; 

Matches 164; Conservative 48; Mismatches 89; Indels 55; Gaps 7; 

Qy 828 KESETFSDSSPI EI IDEFPTFVSAKDDSPKLAKEYTDLEVSDKSEI AN I QS GADS 882 

= 1 I M hi I M -Ml := =: | I = I I 

Db 36 EEREAELDS ELI IESCDASSASEESPKREQDSPPMKPSALDAIREETGVRAEERAPS 92 

Qy 883 LPCLELPCDLSFKNI YPKDEVHVSDEFSENRSSVSKAS I SPSNVSALEPQTEMGS I VKSK 942 

I I II : II I I I MM I 
Db 93 RRGLAEPG- -SFLD-YPSTEPQPGPEL PPGDGALEPETPM 12 9 

Qy 943 SLTKEAEKKLPSDTEKEDRSLSAVLSAE LSKTSWDLLYWRDIKK 987 

II \:: \ : : : |:| MMMMIh 

Db 13 0 LPRKPEEDSSSNQSPAATKGPGPLGPGAPPPLLFLNKQKAIDLLYWRDIKQ 18 0 

Qy 988 TGWFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRI YKGVIQAIQKSDEGHPFRAY 1047 

MM I II 1 1 1 I hi II MM 1 1 1 1 Ml Ml hlhlhlllllhll 

Db 181 TGI VFGSFLLLLFSLTQFS WSWAYLALAALSATI SFRI YKSVLQAVQKTDEGHPFKAY 240 

Qy 1048 LESEVAI SEEL VQKYSNSALGHVNSTI KELRRLFLVDDLVDSLKFAVLMWVFTYVGALFN 1107 

M h MM MM- MIMMIMIIMI M M II II M II h II MM 1 1 

Db 241 LELEITLSQEQIQKYTDCLQFYVNSTLKELRRLFLVQDLVDSLKFAVLMWLLTYVGALFN 3 00 

Qy 1108 GLTLL I LAL I SLFS I P VI YERHQVQ I DHYLGLANKS VKDAMAKI QAKI PGLKRKAD 1163 

M II |: = | = :|:|::| | M Ml III MM = MIMIMM M h 

Db 301 GLTLLLMAWSMFTLPWYVKHQAQIDQYLGLVRTHINAWAKIQAKIPGAKRHAE 356 



RESULT 4 
US-08-700-607-7 

; Sequence 7, Application US/08700607 
; Patent No. 5858708 



; GENERAL INFORMATION: 

APPLICANT: Bandman, Olga 
APPLICANT: Au- Young, Janice 
APPLICANT: Goli, Surya K. 
APPLICANT: Hillman, Jennifer L. 

TITLE OF INVENTION: TWO NOVEL HUMAN NSP-LIKE PROTEINS 
NUMBER OF SEQUENCES: 9 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: Incyte Pharmaceuticals, Inc. 

STREET: 3174 Porter Drive 

CITY: Palo Alto 

STATE : CA 

COUNTRY: U.S. 

ZIP: 94304 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Diskette 

COMPUTER: IBM Compatible 

OPERATING SYSTEM: DOS 

SOFTWARE: FastSEQ Version 1.5 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/08/700,607 

FILING DATE: Filed Herewith 
ATTORNEY/AGENT INFORMATION: 

NAME: Billings, Lucy J. 

REGISTRATION NUMBER: 36,749 

REFERENCE/DOCKET NUMBER: PF-0114 US 
TELECOMMUNICATION INFORMATION: 

TELEPHONE: 415-855-0555 

TELEFAX: 415-845-4166 
INFORMATION FOR SEQ ID NO: 7: 
SEQUENCE CHARACTERISTICS: 

LENGTH: 2 08 amino acids 

TYPE: amino acid 

STRANDEDNESS : s ingl e 

TOPOLOGY: linear 
MOLECULE TYPE: peptide 
IMMEDIATE SOURCE: 

LIBRARY: GenBank 

CLONE: 307311 
US-08-700-607-7 

Query Match 11.7%; Score 68 5; DB 2; Length 2 08; 

Best Local Similarity 67.0%; Pred. No. 2.2e-36; 

Matches 128; Conservative 32; Mismatches 31; Indels 0; Gaps 0; 
Qy 973 KTSWDLLYWRDIKKTGWFGASLFLLLSLTVFSI VSVTAYIALALLSVTISFRI YKGVI 1032 





Db 



18 KSQAIDLLYWRDIKQTGIVFGSFLLLLFSLTQFSWSWAYLALAALSATISFRI YKSVL 7 



Qy 



1033 QAI QKSDEGHPFRAYLESEVAI SEELVQKYSNSALGHVNSTI KELRRLFLVDDLVDSLKF 10 92 



i i " i I * i r i i i i • I I I I I * I ' I -Ml-- : M M : I M |[ M M llllllll 

78 QAVQKTDEGHPFKAYLELEITLSQEQIQKYTDCLQFYVNSTLKELRRLFLVQDLVDSLKF 13 





Db 



Qy 




Db 



Qy 1153 AKI PGLKRKAD 1163 

Mill II h 

Db 198 AKI PGAKRHAE 2 08 



RESULT 5 
US-08-700-607-8 

Sequence 8, Application US/08700607 
Patent No. 5858708 
GENERAL INFORMATION: 

APPLICANT: Bandman, Olga 
APPLICANT: Au-Young, Janice 
APPLICANT: Goli, Surya K. 
APPLICANT: Hillman, Jennifer L. 

TITLE OF INVENTION: TWO NOVEL HUMAN NSP-LIKE PROTEINS 
NUMBER OF SEQUENCES: 9 
CORRESPONDENCE ADDRESS : 

ADDRESSEE: Incyte Pharmaceuticals, Inc. 
STREET: 3174 Porter Drive 
CITY: Palo Alto 
STATE : CA 
COUNTRY: U.S. 
ZIP: 94304 
COMPUTER READABLE FORM: 
MEDIUM TYPE: Diskette 
COMPUTER: IBM Compatible 
OPERATING SYSTEM: DOS 
SOFTWARE: FastSEQ Version 1.5 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER : US/ 08/700 , 607 
FILING DATE: Filed Herewith 
ATTORNEY /AGENT INFORMATION: 
NAME: Billings, Lucy J. 
REGISTRATION NUMBER: 36,749 
REFERENCE/DOCKET NUMBER: PF-0114 US 
TELECOMMUNICATION INFORMATION: 
TELEPHONE: 415-855-0555 
TELEFAX: 415-845-4166 
INFORMATION FOR SEQ ID NO: 8: 
SEQUENCE CHARACTERISTICS: 
LENGTH: 2 67 amino acids 
TYPE: amino acid 
STRANDEDNESS : single 
TOPOLOGY: linear 
MOLECULE TYPE: peptide 
IMMEDIATE SOURCE: 
LIBRARY: GenBank 
CLONE: 281046 
US-08-700-607-8 

Query Match 11.5%; Score 671; DB 2; Length 2 67; 

Best Local Similarity 66.3%; Pred. No. 2.4e-35; 

Matches 124; Conservative 33; Mismatches 30; Indels 0; Gaps 0; 
QY 973 KTS WDLLYWRD I KKTG WFGASLFLLLSLTVFS I VS VTAY I ALALLS VTI S FR I YKGVI 1032 

h - 1 1 1 1 1 M I H M | , . I II III Ihlll Ihlll II III 

Db 9 KSQAIDLLYWRDIKQTGIVFGSFLLLLFSLTQFSWSWAYLALAALSATISFRIYKSVL 68 



Qy 



1033 



QAI QKSDEGHPFRAYLESEVAI SEELVQKYSNSALGHVNSTI KELRRLFLVDDLVDSLKF 1092 





Db 



69 



QAVQKTDEGHPFKAYLELEITLSQEQIQKYTDCLQLYVNSTLKELRRLFLVQDLVDSLKF 12 



Qy 



1093 




Db 



129 



Qy 



1153 



AKIPGLK 115 9 

Mill : 
AKIPGAR 195 



Db 



189 



RESULT 6 

US-09-149-476-563 

; Sequence 563, Application US/09149476 

; Patent No. 6420526 

; GENERAL INFORMATION: 

; APPLICANT: Rosen et al . 

; TITLE OF INVENTION: 186 Human Secreted proteins 
FILE REFERENCE: PZ002P1 

CURRENT APPLICATION NUMBER: US/09/ 14 9 , 476 

CURRENT FILING DATE: 1998-09-08 
; EARLIER APPLICATION NUMBER: PCT/US98 / 04493 
; EARLIER FILING DATE: 1998-03-06 
; EARLIER APPLICATION NUMBER: 60/040,162 

EARLIER FILING DATE: 1997-03-07 
; EARLIER APPLICATION NUMBER: 60/040,333 
; EARLIER FILING DATE: 1997-03-07 
; EARLIER APPLICATION NUMBER: 60/038 , 621 
; EARLIER FILING DATE: 1997-03-07 
; EARLIER APPLICATION NUMBER: 60/040,626 

EARLIER FILING DATE: 1997-03-07 
; EARLIER APPLICATION NUMBER: 60/040,334 

EARLIER FILING DATE: 1997-03-07 

EARLIER APPLICATION NUMBER: 60/040,336 

EARLIER FILING DATE: 1997-03-07 

EARLIER APPLICATION NUMBER: 60/040,163 
; EARLIER FILING DATE: 1997-03-07 
; EARLIER APPLICATION NUMBER: 60/047,600 

EARLIER FILING DATE: 1997-05-23 
; EARLIER APPLICATION NUMBER: 60/047,615 
; EARLIER FILING DATE: 1997-05-23 
; EARLIER APPLICATION NUMBER: 60/047,597 

EARLIER FILING DATE: 1997-05-23 

EARLIER APPLICATION NUMBER: 60/047,502 
; EARLIER FILING DATE: 1997-05-23 

EARLIER APPLICATION NUMBER: 60/047,633 
; EARLIER FILING DATE: 1997-05-23 
; EARLIER APPLICATION NUMBER: 60/047,583 
; EARLIER FILING DATE: 1997-05-23 

EARLIER APPLICATION NUMBER: 60/047,617 

EARLIER FILING DATE: 1997-05-23 

EARLIER APPLICATION NUMBER: 60/047,618 
; EARLIER FILING DATE: 1997-05-23 
; EARLIER APPLICATION NUMBER: 60/047,503 



EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,592 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,581 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,584 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,500 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,587 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,492 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,598 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,613 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,582 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,596 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,612 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,632 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,601 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/043,580 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/043,568 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/043,314 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/043,569 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/043,311 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/043,671 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/043,674 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/043,669 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 6 0/043,312 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/043,313 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/043,672 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/043,315 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/048,974 
EARLIER FILING DATE: 1997-06-06 
EARLIER APPLICATION NUMBER: 60/056,886 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,877 
EARLIER FILING DATE: 1997-08-22 



EARLIER APPLICATION NUMBER: 60/056 , 889 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,893 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,630 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,878 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,662 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,872 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,882 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,637 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,903 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,888 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,879 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,880 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,894 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,911 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,636 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,874 
EARLIER FILING DATE: 1997-08*22 
EARLIER APPLICATION NUMBER: 60/056,910 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,864 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,631 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,845 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,892 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/057,761 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/047,595 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,599 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,588 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,585 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,586 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,590 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,594 



; EARLIER FILING DATE: 1997-05-23 

; EARLIER APPLICATION NUMBER: 60/047,589 

; EARLIER FILING DATE: 1997-05-23 

; EARLIER APPLICATION NUMBER: 60/047,593 

; • EARLIER FILING DATE: 1997-05-23 

; EARLIER APPLICATION NUMBER: 60/047,614 

; EARLIER FILING DATE: 1997-05-23 

EARLIER APPLICATION NUMBER: 60/043,578 
; EARLIER FILING DATE: 1997-04-11 

EARLIER APPLICATION NUMBER: 60/043,576 

EARLIER FILING DATE: 1997-04-11 

EARLIER APPLICATION NUMBER: 60/047,501 

EARLIER FILING DATE: 1997-05-23 

EARLIER APPLICATION NUMBER: 60/043,670 
; EARLIER FILING DATE: 1997-04-11 
; EARLIER APPLICATION NUMBER: 60/056,632 
; EARLIER FILING DATE: 1997-08-22 

EARLIER APPLICATION NUMBER: 60/056,664 

EARLIER FILING DATE: 1997-08-22 
; EARLIER APPLICATION NUMBER: 60/056,876 

EARLIER FILING DATE: 1997-08-22 
; EARLIER APPLICATION NUMBER: 60/056,881 

EARLIER FILING DATE: 1997-08-22 

EARLIER APPLICATION NUMBER: 60/056,909 

EARLIER FILING DATE: 1997-08-22 

EARLIER APPLICATION NUMBER: 60/056,875 

EARLIER FILING DATE: 1997-08-22 

EARLIER APPLICATION NUMBER: 60/056,862 

EARLIER FILING DATE: 1997-08-22 
; EARLIER APPLICATION NUMBER: 60/056,887 

EARLIER FILING DATE: 1997-08-22 

EARLIER APPLICATION NUMBER: 60/056,908 
; EARLIER FILING DATE: 1997-08-22 

EARLIER APPLICATION NUMBER: 60/048,964 
; EARLIER FILING DATE: 1997-06-06 

EARLIER APPLICATION NUMBER: 60/057,650 

EARLIER FILING DATE: 1997-09-05 

EARLIER APPLICATION NUMBER: 60/056,884 
; EARLIER FILING DATE: 1997-08-22 
; EARLIER APPLICATION NUMBER: 60/057,669 

EARLIER FILING DATE: 1997-09-05 

EARLIER APPLICATION NUMBER: 60/049,610 
; EARLIER FILING DATE: 1997-06-13 

EARLIER APPLICATION NUMBER: 60/061,060 
; EARLIER FILING DATE: 1997-10-02 

Query Match 9.2%; Score 539.5; DB 4; Length 168; ' 

Best Local Similarity 59.3%; Pred. No. 3.1e-27 ; 

Matches 99; Conservative 36; Mismatches 31; Indels 1; Gaps 1; 

Qy 998 LLLSLTVFS I VSVTAYIALALLSVTI SFRI YKGVI QAIQKSDEGHPFRAYLESEVAI SEE 1057 

: I I I I M- M =|: IIIIIIIIIIIMI lllhllhl III hUh := =| I 
Db 1 MLLSLAAFSVI S WS YLI LALLSVTI SFRI YKSVI QAVQKSEEGHPFKAYLDVDITLSSE 60 

Qy 1058 LVQKYSNSALGHVNSTIKELRRLFLVDDLVDSLKFAVLMWVFTYVGALFNGLTLLILALI 1117 

I hh hi H : llllhlllllll II lh llllhllhllllll = 
Db 61 AFHNYMNAAMVHINRALKLIIRLFLVEDLVDSLKLAVFMWLMTYVGAVFNGITLLILAEL 120 



Qy 1118 SLFS I PVI YERHQVQI DHYLGLANKSVKDAMAKI QAKI PGL - KRKAD 1163 

:|| = |::||::: llilhhl I : llllhlh hlh 

Db 121 LIFSVPI VYEKYKTQIDHYVGIARDQTKSIVEKIQAKLPGIAKKKAE 167 



RESULT 7 
US-08-700-607-3 

Sequence 3, Application US/08700607 
Patent No. 5858708 
GENERAL INFORMATION: 

APPLICANT: Bandman, Olga 
APPLICANT: Au- Young, Janice 
APPLICANT: Goli, Surya K. 
APPLICANT: Hillman, Jennifer L. 

TITLE OF INVENTION: TWO NOVEL HUMAN NSP-LIKE PROTEINS 
NUMBER OF SEQUENCES: 9 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: Incyte Pharmaceuticals, Inc. 
STREET: 3174 Porter Drive 
CITY: Palo Alto 
STATE : CA 
COUNTRY : U.S. 
ZIP: 94304 
COMPUTER READABLE FORM: 
MEDIUM TYPE: Diskette 
COMPUTER: IBM Compatible 
OPERATING SYSTEM: DOS 
SOFTWARE: FastSEQ Version 1.5 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/ 08 /700 , 6 07 
FILING DATE: Filed Herewith 
ATTORNEY/AGENT INFORMATION: 
NAME: Billings, Lucy J. 
REGISTRATION NUMBER: 36,74 9 
REFERENCE/DOCKET NUMBER: PF-0114 US 
TELECOMMUNICATION INFORMATION: 
TELEPHONE: 415-855-0555 
TELEFAX: 415-84 5-4166 
INFORMATION FOR SEQ ID NO: 3: 
SEQUENCE CHARACTERISTICS: 
LENGTH: 241 amino acids 
TYPE: amino acid 
STRANDEDNESS : s irigl e 
TOPOLOGY: linear 
MOLECULE TYPE: peptide 
IMMEDIATE SOURCE: 

LIBRARY: THP1NOB01 
CLONE: 31870 
US-08-700-607-3 

Query Match 8.9%; Score 519; DB 2; Length 241; 

Best Local Similarity 59.4%; Pred. No. l.le-25; 

Matches 95; Conservative 33; Mismatches 32; Indels 0; Gaps 0; 

Qy 972 SKTSV^LLYWRDIKKTGWFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGV 1031 

I : I Ih llhllll Ml H HIM lh:|| :|: I I I I I I I I I I I | | | | 



Db 



44 SSCAVHDLI XWRDVKKTGFVFGTTLIMLLSLAAFSVI S WSYLI LALLSVTI SFRI YKSV 103 



Qy 1032 IQAIQKSDEGHPFRAYLESEVAISEELVQKYSNSALGHVNSTIKELRRLFLVDDLVDSLK 1091 

U M I I : :| | |:|: |:| : | : | | | | | : | | | | | | | 

Db 104 I QAVQKS EEGH PFKA YLD VD I TLS S EAFHN YMNAAM VH I NRALKL 1 1 RLFLVEDLVDS LK 163 

Qy 1092 FAVLMWVFTYVGALFNGLTLLILALISLFSIPVI YERHQV 1131 

I! Ih llllhllhlMIM : : hh:| 
Db 164 LAVFMWLMTYVGAVFNGI TLLI LAELLI XSVPI VYXKYKV 203 



RESULT 8 

US-08-714-741-32 

Sequence 32, Application US/08714741 
Patent No. 6500613 
GENERAL INFORMATION: 

APPLICANT: Briles, David E. 
APPLICANT: McDaniel, Larry S. 
APPLICANT: Swiatlo, Edwin 
APPLICANT: Yother, Janet 
APPLICANT: Crain, Marilyn J. 
APPLICANT: Holl ingshead , Susan 
APPLICANT: Tart, Rebecca 
APPLICANT: Brooks -Walter , Alexis 

TITLE OF INVENTION: PNEUMOCOCCAL GENES, PORTIONS THEREOF, 

TITLE OF INVENTION: EXPRESSION PRODUCTS THEREFROM, AND USES OF SUCH GENES , 
TITLE OF INVENTION: PORTIONS AND PRODUCTS 
NUMBER OF SEQUENCES: 4 7 
CORRESPONDENCE ADDRESS : 

ADDRESSEE: Curtis, Morris & Safford, P.C. 
STREET: 53 0 Fifth Avenue 
CITY: New York 
STATE: New York 
COUNTRY : U.S. 
ZIP: 10036 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 
COMPUTER: IBM PC compatible 
OPERATING SYSTEM: PC-DOS/MS -DOS 
SOFTWARE: Patent In Release #1.0, Version #1.30 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/ 08 /714 , 74 1 
FILING DATE: 16-SEP-1996 
CLASSIFICATION: 435 
ATTORNEY/AGENT INFORMATION: 

NAME: Frommer Esq., William S. 
REGISTRATION NUMBER: 2 5,506 
REFERENCE/DOCKET NUMBER: 454312-24 6 0 
TELECOMMUNICATION INFORMATION: 
TELEPHONE: (212) 840-3333 
TELEFAX: (212) 840-0712 
INFORMATION FOR SEQ ID NO: 32: 
SEQUENCE CHARACTERISTICS: 
LENGTH: 8 991 amino acids 
TYPE: amino acid 
STRANDEDNESS : single 
TOPOLOGY: linear 



MOLECULE TYPE: amino acid 
US-08-714-741-32 



Query Match 5.2%; Score 3 02.5; DB 4; Length 8991; 

Best Local Similarity 22.1%; Pred. No. 1.4e-09; 

Matches 244; Conservative 125; Mismatches 439; Indels 297; Gaps 47; 

Qy 18 PRPPPAFKYQFVTEPEDEEDEEEEEDEEEDDEDLEELEVLERKPAAGLSAAAVPPAAAAP 77 

hi II ■ ■ I I III Mil:: II I I 

Db 7805 PKPAPAPQPAPAPKPEKTDDQQAEEDYARRSE- -EEYNRLPQQQ PPKAEKP 7853 

Qy 78 LLDFSSDSVPPAPRGPLPAAPPAAPERQPSWERSPAAPAPSLPPAAAVLPSKLPEDDEPP 137 

: I I l = M ! I I I I I II I I I I I I 

Db 7854 APAPKPEQPVPAEXPENP APAPK- - PAXAPQPLKPEEPAEQP 78 93 

Qy 138 ARPPPPPPAGASPLAEPAAPPSTPAAP- -KRRGSGSVDETLFALPAASEPVIPSSAEKIM 195 

I III II I I III: I : I I : I I 

Db 7894 KPEKPEEPAGQ PEPEKPDDQQAGEDYARRSGGEYNRFPQQQPPKAEK- -PAPAPK- - 7946 

Qy 196 DLMEQPGNTVSSGQEDFPSVLLETAASLPSLSPLSTVSF KEHGYLGNLSAVSS 248 

Ml I I h I : I =| I I II : 

Db 7947 --PEQP VPAPKTLLKKAKLAGAKSKAATKKAELEPELEKAEAELENLLSTLD 79 96 

Qy 249 S EGT I E ETLNEAS KE L P E RATN P F VNRDLAE FSEL--EYS EMGS S F KG S P KG E S A I L VEN 306 

II =1 =1 II I I I : III | | = = : | : 

Db 7997. PEG KTQDELDKEAAEAELNKKVEALPNQVSELEEELSKLEDNLKDAETNNVEDYIKE 8053 

Qy 3 07 TKEEVIVRSK DKEDLVCSAALHS PQES P VGKEDRWS PEKTMD I FNEMQMS WAP VR 3 63 

II I : :| llh - = I I h II II 

Db ■ 8054 GLEEAIATKQAELEKTPKELDAALN- -ELGPDGDEEETPPPE APAE 8097 

Qy 364 EEYADFKPFEQAWEVKDTYEGSRDVLAARANVESKVDRKCLED--SLEQKSLGKDSEGRN 421 

■ : I I I : I I I I I h h I h I : 

Db 8098 QPKPE-KPAEETPAPAPKPEKSADQQA EEDYARRSEEEYNRLTQQQPPKAEKPAP 8151 

Qy 422 EDASFPSTPEPVKDSSRAYITCASFTSA TESTTANTFPLLEDHT 465 

I I I I II I I I I II 

Db 8152 APAPKPEQPAPA- PKSRGLATKKKLNLAEARI ELLLKKLGLEPGLEKAGAGLGNLLSTLD 8210 

QY 4 66 SENKTDE KKIEERKAQI - - ITEKTSPKTSNPFLVAVQDSEADYVTTDT 511 

I II : Ihl I : : I : I I -hi : = | 

Db 8211 PEGKTQDELDKEAAEAELNKKVEALPNQVAELEEELSKLEDN LKDAETNH VEDY I 8265 

Qy 512 LSKVTEAAVSNMPE --GLTPDLVQE ACESELNEATGTKIAYET 552 

I III | | II 

Db 8266 KEGLEEAIATKQAELEKTPKELDAALNELGPDGDEEETPAPEAPAEQPKPEKPAEETPAP 8325 

Qy 553 KVDLVQTSEAIQESLYPTAQLCPSFEEAEATPSPVLPDI VMEAP 596 

: I : I I , I I I I h hi h II 

Db 8326 APKPEKSADQQAEEDYARRSEEEYNRL- -TQQQPPKAEKPAPAPAP-KPEQPAPAPKKKQ 8382 

Qy 597 LNSLLPSA GASWQPSVSPLEAPPPVSYDSIKLEPENPPPYEEAM 641 

l-l I lh:: III : I I I II : 

Db 8383 KVNLENLLSTLDPGGKTQDELDKGAAEAELNKKVEALPNPVX ELEEELSPPEDN- - 8436 

Qy 642 NVALKALGT KEGIKE PESFNAAV QETEAPYIS 673 



Db 



8437 - 



- -LKDAETNHVEDYIKEGLEEAIATKQAELEETPQEVDAALNDLVPDGGEEETPAP- - 



- 8490 



Qy 674 IACDLIKETKLSTEPSPDFSNYSEIAKFEKSVPEHAE LVEDSSPESE 72 0 

: : : |:|: : | ||| : || | : |::| 

Db 84 91 APQPDEPAPAPAPNAEQPAPAPKPEKSADQQAEEDYARRSEGEYNRLTQQQPPKAE 8 54 6 

Qy 721 - PVDLFSDDS I PEVPQTQEEAVML MKESLTEVSETVAQHKEERLSASPQ 768 

I I =1 I ::: : | | :|: |: : | 

Db 8 547 KPAPAPAPKPEQPAPAPNKEI ARLQSDLKDAEENNVEDYI KEGLEQAI TNKKAELATTQQ 8 606 

Qy 769 ELGKPYLESFQPNLHSTKDAASNDI PTLTKKE KISLQMEEFNTAI YSNDDLL 820 

: | : | | h | h | ::: ::| = :: | 

Db 8607 NIDKTQKDLEDAELELEKVLATLDPEGKTQDELDKEAAEAELNEKVEALQNQVAELEEEL 8666 

Qy 821 SS KEDKI KES ETFSDS S P I - E 1 1 DEFPTFVSAKDDS PKLAKE YTDLEVSDKSE I AN I QSG 879 

Db 8667 SKLEDNLKDAETNNVEDYIKEGLEE AIATKKAELEKTQKE 8706 

Qy 880 ADSLPCLELPCDLSFKNI YPKDEVHVSDEFSENRSSVSKASISPSNVSALEPQTEMGSI V 939 

I = : I : =| h II I hi 
Db 8707 LDAALNELGPDGD EEETPAPAPQPEKPAEEPEN-PAPAPKPE 8747 

Qy 94 0 KSKSLTKEAEKKLPSDTEKEDRSLS 964 

II -Ih :hl h 

Db 874 8 - -KSADQQAEEDYARRSEEEYNRLT 877 0 



RESULT 9 

US-09-149-476-411 

; Sequence 411, Application US/09149476 

; Patent No. 6420526 

; GENERAL INFORMATION: 

; APPLICANT: Rosen et al . 

; TITLE OF INVENTION: 186 Human Secreted proteins 

; FILE REFERENCE: PZ002P1 

; CURRENT APPLICATION NUMBER: US/09/14 9,476 

; CURRENT FILING DATE: 1998-09-08 

; EARLIER APPLICATION NUMBER: PCT/US98/ 044 93 

; EARLIER FILING DATE: 1998-03-06 

; EARLIER APPLICATION NUMBER: 60/040,162 

; EARLIER FILING DATE: 1997-03-07 

; EARLIER APPLICATION NUMBER: 60/040,333 

; EARLIER FILING DATE: 1997-03-07 

; EARLIER APPLICATION NUMBER: 60/038,621 

; EARLIER FILING DATE: 1997-03-07 

; EARLIER APPLICATION NUMBER: 60/040,626 

EARLIER FILING DATE: 1997-03-07 

; EARLIER APPLICATION NUMBER: 60/040,334 

; EARLIER FILING DATE: 1997-03-07 

; EARLIER APPLICATION NUMBER: 60/04 0,336 

; EARLIER FILING DATE: 1997-03-07 

EARLIER APPLICATION NUMBER: 60/040,163 

; EARLIER FILING DATE: 1997-03-07 

; EARLIER APPLICATION NUMBER: 60/047,600 

; EARLIER FILING DATE: 1997-05-23 

; EARLIER APPLICATION NUMBER: 60/047,615 

; EARLIER FILING DATE: 1997-05-23 



EARLIER APPLICATION NUMBER: 60/047,597 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,502 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,633 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,583 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,617 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,618 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,503 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,592 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,581 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,584 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,500 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,587 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,492 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/04 7,598 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,613 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,582 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,596 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,612 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,632 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,601 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/043,580 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/043,568 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/043,314 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/043,569 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/043,311 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/043,671 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/043,674 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/043,669 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/043,312 



EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/043,313 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/043,672 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/043,315 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/048,974 
EARLIER FILING DATE: 1997-06-06 
EARLIER APPLICATION NUMBER : 60/056,886 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,877 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,889 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,8 93 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,630 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,878 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,662 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,872 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,882 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,637 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,903 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,888 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,879 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,880 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,894 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,911 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,636 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,874 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER : 60/056,910 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,864 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,631 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,845 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,892 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/057,761 
EARLIER FILING DATE: 1997-08-22 



EARLIER APPLICATION NUMBER: 60/047,595 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,599 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/04 7,588 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,585 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,586 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,590 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,594 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,589 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,593 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,614 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/043,578 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/043,576 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/047,501 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/043,670 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/056,632 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER : 60/056,664 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,876 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,881 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,909 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,875 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,862 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,887 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,908 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/048,964 
EARLIER FILING DATE: 1997-06-06 
EARLIER APPLICATION NUMBER: 60/057,650 
EARLIER FILING DATE: 1997-09-05 
EARLIER APPLICATION NUMBER: 60/056,884 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/057,669 
EARLIER FILING DATE: 1997-09-05 
EARLIER APPLICATION NUMBER: 60/049,610 
EARLIER FILING DATE: 1997-06-13 
EARLIER APPLICATION NUMBER: 60/061,060 



; EARLIER FILING DATE: 1997-10-02 

Query Match 4.9%; Score 286; DB 4; Length 92; 

Best Local Similarity 56.0%; Pred. No. 2e-ll; 

Matches 51; Conservative 21; Mismatches 19; Indels 0; Gaps 0; 

Qy 1064 NSALGHVNSTI KELRRLFLVDDLVDSLKFAVLMWVFTYVGALFNGLTLLI LALI SLFS I P 1123 

hh hi H : I I ! I I - I I ! I I I II lh llllhlihllllM : HM 
Db 2 NAAMVHI NRALKLI I RLFLVEDLVDSLKLAVFMWLMTYVGAVFNGI TLLI LAELLI FSVP 61 

Qy 1124 VIYERHQVQIDHYLGLANKSVKDAMAKIQAK 1154 

-•lh- II II hi : I I = I I =| 

Db 62 IVYEKYKTQIDHYVGIARDQTKSIVEKI PSK 92 



RESULT 10 
US-08-973-462-8 

; Sequence 8, Application US/08973462B 

; Patent No. 6191270 

; GENERAL INFORMATION: 

; APPLICANT: DRUILHE , PIERRE 

; APPLICANT: DAUBERS I ES , PIERRE 

; TITLE OF INVENTION: MALARIAL PRE - ERYTHROCYTI C STAGE POLYPEPTIDE MOLECULES 

; FILE REFERENCE: 0660-0125-0 PCT 

; CURRENT APPLICATION NUMBER: US/08/973 , 462B 

CURRENT FILING DATE: 1998-02-06 
; EARLIER APPLICATION NUMBER: PCT/FR96/ 00894 

EARLIER FILING DATE: 1996-06-12 

EARLIER APPLICATION NUMBER: FR 95/07007 
; EARLIER FILING DATE: 1995-06-13 
; NUMBER OF SEQ ID NOS : 2 9 

SOFTWARE: PatentlnVer. 2.0 
; SEQ ID NO 8 

LENGTH: 1786 
TYPE : PRT 

ORGANISM: Artificial Sequence 
FEATURE : 

OTHER INFORMATION: Description of Artificial Sequence : Polypeptide 
US-08-973-462-8 

Query Match 4.8%; Score 279.5; DB 3; Length 1786; 

Best Local Similarity 20.1%; Pred. No. 3.9e-09; 

Matches 261; Conservative 220; Mismatches 518; Indels 297; Gaps 60; 

Qy 33 EDEEDEEEEEDEEEDDEDLEELEVLERKPAAGLSAAAVPPAAAAPLLDFSSDS VP 87 

I h II III :|::|| I I I h : I :| I || | 

Db 225 EKVEESVEENDEESVEENVEE -NVEENDDG SVASS VEES I ASS VDES I DSS I EENVA 280 

Qy 88 P APRGPLPAAPPAAPERQPSWERSPA APAPSLPPAAAV-LPSKLPEDDE 135 

I II II : : I I I : | :: : | : : |: | 

Db 281 PTVEEI VAPSWESVAPSVEESVEENVEESVAENVEESVAENVEESVAENVEESVAENVE 34 0 

Qy 136 PPARP PPPPPAGASPLAEPAAPPSTPAAPKRRGSGSVDETLFALPAASEPVIPS 189 

I I H : II : : II I : = =| I I 

Db 341 EI VAPTVEEIVAPTVEEI VAPSWESVAPSVEESVEENVEESVAENV- -EESVAENVEES 3 98 



Qy 



190 SAEKIMDLMEQPGNTVSSGQEDFPSVLLETAASL- 



-PSLSPLSTVSFKEHGYLGNLSAVS 24 7 



Db 


399 


VAENVEESVAE- - NVEESVAENVEEI VAPTVEEI VAPTVEEI VAPSWE SVAP 


449 


Qy 


248 


SSEGTIEETLNEASKELPERATNPFVNRDLAEFSELEYSEMGSSFKGSPKGESAILVENT 


307 


Db 


450 


1 1 -II : |: | | : | :|| | || : || 

SVEESVEENVEESVAENVEESVAENVEESVAENVE ES--VAENV 


491 


Qy 


308 


KEEVIVRSKDKEDLVCSAALHSPQESPVGKEDRWSPEKTMDIFNEMQMSWAPVREEYA 367 
• II . . . 1 . 1 -.11 ..l.l I . . .ill M 


Db 


492 


*ii • * * I • l --II * • 1 ■ 1 1 ■ • -III II. 
EESV AENVEESVA ENVEESVAENVEEIVAP TVE EI VAPTVEEI V 


535 


Qy 


368 


DFKPFEQ-AWEVKDTYEGSRDVLAARANVESKVDRKCLEDSLEQKSLGKDSEGRN-EDAS 
1 1 h = : 1 = = i III 1 :|:|: : -1 1 h 


425 


Db 


536 


APS WES VAPS VEES VEENVEES VAE - NVEES VAEN - VEES VAENV- - EES VAENVEE I V 


591 


Qy 


426 


FPSTPEPVKDSSRAYI TCASFTSATESTTANTFPLLEDHTSENKTDEKKI EERKAQI ITE 


485 


Db 


592 


h 1 1 = : IN | =|: || :|| |: = | 
APTVEEI VAPTVEEI VA PSWESVA PSVEESVEEN VEESVAENVEE 


637 


Qy 


486 


KTSPKTSNPFLVAVQDSEADYVTTDTLSKVTEAAVSNMPEGLTPDLVQEACESELNEATG 


545 


Db 


638 


SVAENVEESVAENVEEI VAPTVEEI VAPTVEEI VAPS WES VAPS -VEES VEENVEES VA 


696 


Qy 


546 


TKIAYETKVDLVQTS- -EAIQESLYPTAQ- -LCPSFEEAEATPSPVLPDI VMEAPLNSLL 


601 


Db 


697 


■ : \: : \: \ \ ::||: : : |: || | | : :|| : : |: 

ENVE-ESVAENVEESVAENVEESVAENVEEI VAPTVEEI VA PTVEEIVAPSWESVA 


752 


Qy 


602 


PSAGASV VQPSVSPLEAPPPVSYDSIKLE 


630 


Db 


753 


M M 1 hi : II :| 

P S VE E S VE EN VE E S VAEN VE E S VAEN VEES VAEN VEESVAPTVEE I VAP SVE 


804 


Qy 


631 


PENPPPYEEAM NVALKALGTKEGI KEPESFNAAVQETEAPYI S IACDLI K 


680 


Db 


805 


ESVAPS VEESVAENVATNLSDNLLSNLLGGI ETEEIKDS I LNE I EEVKENWTTI LENVE 


864 


Qy 


681 


ETKLSTEPS PDFSNYSE I AK FEKSVPEHAELV -EDSSPESEPVDLF 


725 


Db 


865 


11=1 INI: h : I | | |:: | | :: 

ET- -TAESVTTFSNILEEIQENTITNDTIEEKLEELHENVLSAALENTQSEEEKKEVIDV 


922 


Qy 


726 


SDDSI PEVPQTQEEAVMLMKESLTE VSETV 

1 ..1 1. . II . 1 . . 1 | || 


755 


Db 


923 


1 ■ ■ 1 1 - ■ II * 1 • * 1 1 II 
I EEVKEEVATTLI ETVEQAEEKSANTI TEI FENLEENAVESNENVAENLEKLNETVFNTV 


982 


Qy 


756 


AQHKEERLSASPQELGKPYLES- -FQPNLHSTKDAASNDI PTLTK- - KEKI SLQMEE - - - 


808 


Db 


983 


11= 1 = 1 :: 1 =1 I ■ ■ ■ ■ Mil 

LDKVEETVE I SGESLENNEMDKAFFSEI FDNVKGI QENLLTGMFRS I ETS I VI QSEEKVD 


1042 


Qy 


809 


FNTAI YSN - - DDLLSSKEDKI KESETFSDSS PI EIIDEF - PTFVS 


850 


Db 


1043 


1 = h h= : II : : I | : : | : | |: 

LNENWSS I LDN I ENMKEGLLNKLENI SSTEGVQETVTEHVEQNVYVDVDVPAMKDQFLG 


1102 


Qy 


851 


AKDDSPKLAKEYTDLEVSDKSEIANIQSGADSLPCLELPCDLSFKNIYPKDEVHVSDEFS 


910 


Db 


1103 


1 = = =11 III :| : h : 1 : h 1 : : 1 

I LNEAGGLKEMFFNLEDVFKSE SDVI TVEE I KDEP VQKEV- EKETVS 1 1 EEME 


1154 


Qy 


911 


ENRSSV SKAS I SPSNVSALEPQTEMGS I VKSKSLT KEAEKKLPSDTEK EDR 


961 



Db 



1155 EN I VDVLEEEKEDLTDKM I DAVEES I E I S SDSKEETES I KDKEKDVSL WEE VQDNDMDE 1214 



Db 



Qy 



962 SLSAVL SAELSKTSV-VDLLYWRDIKKTGWFGASLFLLLSLTVFSIVSVTAYIA 1015 

h II II I :| - = : |::| : |: : 
1215 SVEKVLELKNMEEELMKDAVEINDITSKLIEETQELNEVEADLIKDME K 1263 



Qy 



1016 LALLSVTI SFRI YKGVI QA IQK-SDEGHPFRAYLESEVA ISEELVQKYSN- -S 1065 

II : I I : I I ::| :| I h I : h ::| h 

12 64 LKELEKALS - EDSKE 1 1 DAKDDTLEKVI EEEHDI TTTLDEWELKDVEEDKI EKVSDLKD 1322 



Db 



Qy 



1066 ALGHVNSTIKELRRLFLVDDLVDSLKFAVLMWVFTYVGALFNGLTLLILALISLFSIPVI 1125 



Db 



1323 LEEDILKEVKEIKE- -LESEILEDYK 



ELKTIETDIL 1356 



Qy 



1126 YERHQVQIDHYLGLANKS- -VKDAMAKIQAKIPGLK 1159 

h lh - : I I I I - h 

1357 EEKKEI EKDHFEKFEEEAEEI KDLEADI LKEVSSLE 1392 



Db 



RESULT 11 
US-08-769-309A-5 

/Sequence 5, Application US/08769309A 

; Patent No. 5741890 

; GENERAL INFORMATION: 

APPLICANT: Scott, John D., 

APPLICANT: Nauert , Brian J., 

APPLICANT: Klauck, Theresa M. 

TITLE OF INVENTION: Protein Binding Domains of Gravin 
NUMBER OF SEQUENCES : 24 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: Marshall, O'Toole, Gerstein, Murray & Borun 

STREET: 63 0 0 Sears Tower/233 South Wacker Drive 

CITY: Chicago 

STATE: Illinois 

COUNTRY: United States of America 

ZIP: 60606-6402 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 

COMPUTER: IBM PC compatible 

OPERATING SYSTEM: PC-DOS/MS-DOS 

SOFTWARE: Patent In Release #1.0, Version #1.30 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/08/769 , 3 09A 

FILING DATE: 

CLASSIFICATION: 435 
ATTORNEY/AGENT INFORMATION: 

NAME: No. 5741890and, Greta E. 

REGISTRATION NUMBER : 35,3 02 

REFERENCE/DOCKET NUMBER: 27866/33451 
TELECOMMUNICATION INFORMATION: 

TELEPHONE: 312-474-6300 

TELEFAX: 312-474-0448 

TELEX: 25-3856 
INFORMATION FOR SEQ ID NO: 5: 
SEQUENCE CHARACTERISTICS: 

LENGTH: 178 0 amino acids 

TYPE: amino acid 



TOPOLOGY: linear 
MOLECULE TYPE: protein 
US-08-769-309A-5 

Query Match 4.5%; Score 265.5; DB 1; Length 178 0; 

Best Local Similarity 20.8%; Pred. No. 3e-08; 

Matches 264; Conservative 146; Mismatches 486; Indels 371; Gaps 51; 

Qy 11 SSSTDSPPRPPPA FKYQFVTE PEDE EDEEEEEDEEED 47 

I I = I I I : I I : I I : III I -hi h I 

Db 277 SKSAESPTSPVTSETGSTFK-KFFTQGWAGWRKKTSFRKPKEDEVEASEKKKEQEPEKVD 335 

QY 48 DEDLEELEVLERKPAAGLSAAAVPPAAAA- - PLLDFSSDSVPPAPRGPLPAAPPAAPERQ 105 

h = M I I I II :| I I : | |h I I 

Db 336 TEEDGKAEVASEKLTASEQAHPQEPAESAHEPRLSAEYEKV -ELPS EEQ 383 

Qy 106 PSWERSPAAPAPSLPPAAAVLPSKLP EDDEPPARPPPPPPAGASP 150 

I : I : i : I I I h h : |h I 

Db 384 VSGSQGPSEEKPA-PLATEVFDEKIEVHQEEWAEVHVSTVEERTEEQKTEVEETAGSVP 442 

Qy 151 L AEP- -AAPPSTPAAPKRRGSGSVDETLFALPAASEPVIPSSAEKIMDLMEQP 201 

III I I I I | : : : | 

Db 443 AEELVGMDAEPQEAEPAKELVKLKETCVSGEDPTQGADLSPDEKVLSKPPEGWSEVEML 502 

Qy 202 GNTVSSGQEDFPSVLLETAASLPSLSPLSTVSFKEHGYLG NLSAVSSSEGTI 253 

: I I l : I II hi I : : | : 

Db 503 S SQERMKVQGS PLKKLFTSTGLKKLS GKKQKGKRGGGDEESGEHTQVPADSPDSQ 557 

Qy 254 EETLNEASKELPERATN-PFVNRDLAEFSELEYSEMGSSFKGSPKGESAILVENTKEEVI 312 

II hi II : : III : :| | = = I I I = |= | 

Db 558 EEQKGESSASSPEEPEEITCLEKGLAEVQQDGEAEEGATSDGEKKREGVTPWASFKKMVT 617 

Qy 313 VRSK DKE DLVCSAALHSPQESPVGKEDRWSPEKTMDIFNEMQMSWAPV 3 62 

III I I II I I : : :: ||: || | 

Db 618 PKKRVRRPSESDKEDELDKVKSATLSSTEST ASEMQEEMKGSVEEPK 664 

Qy 3 63 REEYADFKPFEQAWEV KDTYEGS 38 5 

M : 1 1 II h 

Db 665 PEEPKRKVDTSVSWEALICVGSSKKRARRRSSSDEEGGPKAMGGDHQKADEAGKDKETGT 724 

Qy 386 RDVLA ARANVESK VDRKCLEDSLEQKSLGKDS 417 

HI | : | : || : ||:|| :|| 

Db 725 DGILAGSQEHDPGQGSSSPEQAGSPTEGEGVSTWESFKRLVTPRKKSKSKLEEKS--EDS 782 

Qy 418 -EGRNEDASFPSTPEPVKDSS RAYITCASFTSATE- -ST 453 

Db 783 IAGSGVEHSTPDT-EPGKEESWVSIKKFIPGRRKKRPDGKQEQAPVEDAGPTgJ^EDDSD 841 

Qy 454 TANTFPLLEDHTSENKTDE KKIEERKAQIITEKTSPKTSNPFLVAVQD 501 

III I : I :: h= I :::: I : || I 

Db 842 VPAWPLSEYDAVEREKMEAQQAQKGAEQPEQKAATEVSKELSESQVHMMAAAVADGTRA 901 

Qy 502 S EADY VTTDTLS K VTEAAVSNMPEG- 526 

= 11 =1 : I = III HI 

Db 902 ATI IEERSPSWISASVTEPLEQVEAEAALLTEEVLEREVIAEEEPPTVTEP LPENR 957 

Qy 527 LTPDLVQEA-CESELNEATGTKI -AYETKVDLVQTSEAIQESLYPTAQ 572 



Db 958 EARGDTWSEAELTPEAVTAAETAGPLGSEEGTEASAAEETTEMVSAVSQLTDSPDTTEE 1017 

Qy 573 L CPSFEEAEATPSPVLPDIVMEAPLNSLLPSAGASWQPSVSPLEAPPPVS 623 

I IN M : : | || | | : || 

Db 1018 ATPVQEVEGGVPDIEEQERRTQEVLQAVAEKVKEESQLPGTG GPEDVLQPVQ 1069 

Qy 624 YDSIKLEPENPPPYEEA MNVALK ALGTKEGI KEPESFNAAVQE 666 

MM II I || 1 1 1 1 | | 

Db 1070 RAEAERPEEQAEASGLKKETDWLKVDAQEAKTEPFTQGKWGQTTPESFEKAPQV 1125 

Qy 667 TEAPYI SIACDLI KETKLSTEPSPD- -FSNYSEIAKFEKSVPEHAELVEDSSPES - - EPV 722 

M: h -I I : h |:::| hi | 

Db 1126 TES IESSELVTTCQAETLAGVKSQEMVMEQAIP PDSVETPT 1166 

Qy 723 DLFSDDSIP EVPQTQEEAVMLMKESLTEVSETVAQH- KEERLSASPQELGKPY 774 

I : I I I : I I : | : : | : | | : | | : :| 

Db 1167 DSETDGSTPVADFDAPGTTQ KDEI VEIHEENEVHLVPVRGTEAEAVPAQKERPP 1220 

Qy 775 LESFQPNLHSTKDAASNDI PTLTKKEKI SLQMEEFNTAI YSNDDLLSSKEDKI KESETFS 834 

Db 1221 APSSFVFQEETKE QSKMEDTLEHTDKEVSVETVSILSKTEGTQEADQYA 1269 

Qy 835 DSSPIEI IDEFPTFVSAKDDSPKLAKEYTDLEVSDKSEIANIQSGADSLPC 885 

Db 1270 DEKTKDVPFFEGLEGSIDTGITVSREKVTEV 1329 

Qy 886 LELPCDLSF KNI YPKDEVHVSDEFSENRSS VSKASISPSNVSALEPQTE 934 

I : : I I - I h - III : || :: | 

Db 1330 PVEREMVVQVEREKTEAEPTHVNEEKLEHETAVTVSEEVSKQLLQTVNVPI I DGAKE 1386 

Qy 935 MGS I VKSKSLTKEAEKKLPSDTEKEDRSLSAVLSAELSKTSWDLLYWRDI KKTGWF - - 992 

= I: I |: : : : : | |:| : | | :| :|| 

Db 1387 VSSLEGSPPPCLGQEEAVCTKIQVQSSEASFTLTAAAEEEKV- -LGETANILETGETLEP 1444 

Qy 993 -GASLFL 9 98 

II I I 

Db 144 5 AGAHLVL 14 51 



RESULT 12 
US-08-994-570-5 

; Sequence 5, Application US/08994570 

; Patent No. 6090929 

; GENERAL INFORMATION: 

APPLICANT: Scott, John D., 

APPLICANT: Nauert , Brian J, , 

APPLICANT: Klauck, Theresa M . 

TITLE OF INVENTION: Protein Binding Domains of Gravin 
NUMBER OF SEQUENCES: 24 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: Marshall, 0' Toole, Gerstein, Murray & Borun 

STREET: 63 00 Sears Tower/233 South Wacker Drive 

CITY: Chicago 

STATE: Illinois 

COUNTRY: United States of America 
ZIP: 60606-6402 



COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 

COMPUTER: IBM PC compatible 

OPERATING SYSTEM: PC-DOS/MS-DOS 

SOFTWARE: Patent In Release #1.0, Version #1.30 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/08/994,570 

FILING DATE: 

CLASSIFICATION: 
ATTORNEY/AGENT INFORMATION: 

NAME: No. 6090929and, Greta E. 

REGISTRATION NUMBER: 35,302 

REFERENCE/DOCKET NUMBER: 27866/33451 
TELECOMMUNICATION INFORMATION: 

TELEPHONE: 312-474-6300 

TELEFAX: 312-474-0448 

TELEX : 25-3856 
INFORMATION FOR SEQ ID NO: 5: 
SEQUENCE CHARACTERISTICS: 

LENGTH: 178 0 amino acids 

TYPE: amino acid 

TOPOLOGY: linear 
MOLECULE TYPE: protein 
US-08-994-570-5 

Query Match 4.5%; Score 265.5; DB 3; Length 178 0; 

Best Local Similarity 20.8%; Pred. No. 3e-08; 

Matches 264; Conservative 146; Mismatches 486; Indels 371; Gaps 51; 

Qy 11 SSSTDSPPRPPPA FKYQFVTE PEDE EDEEEEEDEEED 47 

I I =11 I = II =1 h IN I ::|:| h | 

Db 277 SKSAESPTSPVTSETGSTFK-KFFTQGWAGWRKKTSFRKPKEDEVEASEKKKEQEPEKVD 335 

QY 4 8 DEDLEELEVLERKPAAGLSAAA.VPPAAAA--PLLDFSSDSVPPAPRGPLPAAPPAAPERQ 105 

h : II II I || :| | | : | ||: i I 

Db 336 TEEDGKAEVASEKLTASEQAHPQEPAESAHEPRLSAEYEKV ELPS EEQ 3 83 

Qy 106 PSWERSPAAPAPSLPPAAAVLPSKLP EDDEPPARPPPPPPAGASP 150 

I : h h I I I h h : ||: | 

Db 384 VSGSQGPSEEKPA-PLATEVFDEKIEVHQEEWAEVHVSTVEERTEEQKTEVEETAGSVP 442 

QY 151 L AEP- -AAPPSTPAAPKRRGSGSVDETLFALPAASEPVT PSSAEKIMDLMEQP 201 

III I I I II I ■■ I 1= I - :| 

Db 443 AEELVGMDAEPQEAEPAKELVKLKETCVSGEDPTQGADLSPDEKVLSKPPEGWSEVEML 502 

QY 2 02 GNTVSSGQEDFPS VLLETAASLPSLSPLSTVSFKEHGYLG NLSAVSSSEGTI 253 

: I I I ' I I I hi I : : | : 

Db 503 SSQERMKVQGSPLKKLFTSTGLKKLS GKKQKGKRGGGDEESGEHTQVPADSPDSQ 557 

Qy 254 EETLNEASKELPERATN-PFVNRDLAEFSELEYSEMGSSFKGSPKGESAILVENTKEEVI 312 

II hi II -III : :| h= I I I : h | 

Db 558 EEQKGESSASSPEEPEEITCLEKGLAEVQQDGEAEEGATSDGEKKREGVTPWASFKKMVT 617 

Qy 313 VRSK DKE DLVCSAALHSPQESPVGKEDRWSPEKTMDI FNEMQMSWAPV 362 

III I I II I I = : :: Ih II I 

Db 618 PKKRVRRPSESDKEDELDKVKSATLSSTEST ASEMQEEMKGSVEEPK 664 



Ov 


363 


REEYADFKPFEQAWEV - - KDTYEGS 


— > O -J 


Db 


665 


II =11 II h 

PEEPKRKVDTS VSWEATiT rvn^^KKRARRR^^^nFEnnPKAMnnnHOK'AnPAriK'nK'F'TnT 


724 


Ov 


386 


RDVLA- ARANVF^K - \/T)RKT , T,F i nQT,PnK'C3T,nK'nc: 
: 1 1 1 : 1 : II : Ihll Ml 


417 


Db 


725 


DGILAGSOEHDPnOfi^^^PFOAn^PTFnRnv^TWPCIFKRTATTPRK'KCIK'QT^T FFK"Q _T7nc; 


7ft 9 


Ov 


418 


-EGRNEDASFPSTPEPVKDSS RAYTTnA^FT^ATF- -qT 


4 R 7 

^± ZJ j 


Db 


783 


1 : 1 II II h 1 I 

I AGSGVEH^TPDT-FPGKFF^WV^T KKRT PCIRRKKR PnnKDFnAPVRnarPTraMPnncin 


Q A 1 
oil 


Ov 


454 


TANTFPLLEDHTSENKTDE KKTRFRKAOT TTRKT^PfCT^NPFT.VAVnn 




Db 


842 


VPAWPLS E YDA VER E KM EAQQAQ KGAEQ P E QKAAT E VS KE LS ES QVHMMAAAVADGTRA 


ZJ \J X. 


Ov 


502 


O \-it\LJ X v X IJJlUiJ IV V X "'nn V OiN 11 £r CAj 




Db 


902 


■■ 1 1 =1 = 1 = III HI 

ATI I EER^P^WT ^A^VTEPTiFOVFAFAATiT .TPFVT ,FPFVTAPPPPPT\/TFP T.PFMP 

■f*. x x iJjLii\jr uni Ort-O v xjZjxtxjxj^v DxlCirlriXJij X L-i Hi V XjLj is. Hi V X X_i Hi tr r 1 V X Hi xT XJ t ILXNrC 


JO 1 


Ov 


527 


LTPDTiVOEA rF^FTiMFATGTKT - AYFTKVnT.VOT^FA TOP^T-YPTAO 


D / Z 


Db 


958 


llh II 1 ||: | | ::| : :| | : 

EARGDT WS EAELTP EAVTAAETAG PLGS E EGTEAS AAE ETTEMVS AVSQLTDS PDTTEE 


i ni 7 


Qv 


573 


L CPSFEEAEATPSPVLPDIVMEAPLNSLLPSAGASWQPSVSPLEAPPPVS 


623 


Db 


1018 


1 Ml II = = 1 II 1 1 : || 
ATPVQEVEGGVPDIEEQERRTQEVIjQAVAEKVKEESQLPGTG GPEDVLQPVQ 


1069 


Ov 


624 


YDSIKLEPENPPPYEEA rMNVALK ALGTKEGI KEPESFNAAVQE 

. f 1 1 II .III 1 1 1 1 1 1 1 1 


666 


Db 


1070 


- ill ll -ill 1 1 1 1 1 1 1 1 
RAEAERPEEQAEASGLKKETDVVLKVl^AQEAKTEPFTQGKVVGQTTPESFEKAPQV 1125 


Qy 


667 


TEAP YI S I ACDLI KETKLSTEPS PD - - FSNYSEI AKFEKS VPEHAELVEDSS PES - - EPV 


722 


Db 


1126 


lh h -i 1 : h |:::| hi | 
TES 1 ESSELVTTCQAETLAGVKSQEMVMEQAI P PDSVETPT 


1166 


Qy 


723 


DLFSDDS IP E VPQTQE EA VMLMKES LTE VS ETVAQH KEERLSAS PQELGKPY 

1 : 1 1 1 : 1 1 : h : h | I : | | = :| 


774 


Db 


1167 


DS ETDGSTP VADFDAPGTTQ KDE I VE I HEENE VHLVP VRGTEAEAVPAQKERP P 


1220 


Qy 


775 


LESFQPNLHSTKDAASNDI PTLTKKEKI SLQMEEFNTAI YSNDDLLSSKEDKI KESETFS 


834 


Db 


1221 


1 lh : |: :| : : : || : :|:: :: 

APSSFVFQEETKE QSKMEDTLEHTDKEVSVETVSILSKTEGTQEADQYA 


1269 


Ov 


835 


DSSPIEI 1 DEF PTF VSAKDDS PKIiAKE YTDLEVSDKS E I AN I QSGADSLPC 

1 II 1 1 llh 1 : :|| I I I 


885 


Db 


1270 


DEKTKDVPFFEGLEGSIDTGITVSREKVTEVALKGEGTEEAECKKDDALELQSHAKSPPS 


1329 


Ov 


886 


LELPCDLSF KNI YPKDEVHVSDEFSENRSS VSKAS I SPSNVSALEPQTE 


934 


Db 


1330 


1 ■ 1 1 - 1 h - IN = II - 1 
PVEREMWQVEREKTE^EPTHVNEEKLEHETAVTVSEEVSKQLLQTVNVPI IDGAKE 


1386 


Qy 


935 


MGSIVKSKSLTKE^EKKLPSDTEKEDRSLSAVLSAELSKTSVVDLLYWR'DIKKTGVVF-- 


992 


Db 


1387 


= 1= 1 |: : : : : | |:| = | | :| :|| 
VSSLEGSPPPCLGQEEAVCTKIQVQSSEASFTLTAAAEEEKV- -LGETANILETGETLEP 


1444 


Qy 


993 


-GASLFL 998 





II I I 

Db 1445 AGAHLVL 1451 



RESULT 13 

US-09-134-001C-4463 

; Sequence 4463, Application US/09134001C 
; Patent No. 6380370 
; GENERAL INFORMATION: 

; APPLICANT: Lynn Doucette-Stamm et al 

; TITLE OF INVENTION: NUCLEIC ACID AND AMINO ACID SEQUENCES RELATING TO 
STAPHYLOCOCCUS 

; TITLE OF INVENTION: EPIDERMIDIS FOR DIAGNOSTICS AND THERAPEUTICS 
; FILE REFERENCE: GTC-007 

; CURRENT APPLICATION NUMBER: US/09/134 , 001C 

CURRENT FILING DATE: 1998-08-13 
; PRIOR APPLICATION NUMBER: US 60/064 ,964 
; PRIOR FILING DATE: 1997-11-08 
; PRIOR APPLICATION NUMBER: US 60/055,779 

PRIOR FILING DATE: 1997-08-14 
; NUMBER OF SEQ ID NOS : 5674 
; SEQ ID NO 4463 
LENGTH: 213 7 
TYPE : PRT 

ORGANISM: Staphylococcus epidermidis 
US-09-134-001C-4463 



Query Match 4.5%; Score 265; DB 4; Length 2137; 

Best Local Similarity 19.6%; Pred. No. 4.3e-08; 

Matches 172; Conservative 141; Mismatches 451; Indels 114; Gaps 24; 



Qy 


159 


STPAAPKRRGSGSVDETLFALPAASEPVI PSSAEKIMDLMEQPGNTVSSGQEDFPSVLLE 218 
i 1 III . 1 . I.I . .1.1 . I 


Db 


990 


ii- ill * I * I - I - • | * | . | 

STSTSTSDSASTSTSE SESDSASTSLSESTSTSVSDSTSTSTSDSASMSASESE 


1043 


Qy 


219 


TAASLPSLSPLSTVSFKEHGYLGNLSAV- - - SSSEGTI EETLNEASKELPERATNPFVNR 


275 


Db 


1044 


= ■ Ml =: I |= || hi | | : | | | : 
SNSKSTSLSESTSTSLS GSTSASTSDSASTSTSESESDSTSTSLSESTSTSLSGS 


1098 


Qy 


276 


DLAEFS ELE YS EMGSS FKGS PKGESAI LVE NTKEEVI VRSKDKEDLVC 


323 


Db 


1099 


1 h 1 h : II 1 1 :| 1 : :' I 

TSASTSD SASTSTSESDSTSESTSLSESLSTSVSDSTSASTSESASTSTSESESNSA 


1155 


-Qy 


324 


SAALHSPQESPVGKEDRWSPEKTMDIFNEMQMSWAPVREEYADFKPFEQAWEVKDTYE 383 
1 • 1 . . . . . 1 . . 1 .1 


Db 


1156 


I • l - - • • - | • • | - | : 

STSLSGSLSTSISDSTSTSTSDSASTSTSESESDSTSTSLSE - - -STSTSLSDSTS 


1208 


Qy 


384 


GSRDVLAARANVESKVDRKCLEDSLEQK- - -SLGKDSEGRNEDASFPSTPEPVKDSSRAY 


440 


Db 


1209 


TSTSESASTSTSES - -DSTSESTSLSESTSTSVSDSTSASTSDSASTSTSVSDSESASTS 


1266 


Qy 


441 


ITCASFTSATESTTANTFPLLEDHTSE- -NKTDEKKIEERKAQI ITEKTSPKTSNPFLVA 


498 


Db 


1267 


h : II ::||: :| III : :: : | : ::: || ||: : 
ISESLSTSVSDSTSTSTSDSASTSTSESDSTSESTSLSESISTSVSDSTSASTSDSASTS 


1326 


Qy 


499 


VQDSEADYVTTD TLSKVTEAAVSNMPEGLTPDLVQEACESELNEATGTKIAYE 


551 



h =1 I 



Db 



1327 TSESESDSASTSLSGSTSTSLSDSTSTSTSDSASTSTSESDSERASTSLSGSTSTSLSDS 13 8 6 



Qy 552 TKV DLVQTSEAIQESLYPTAQLCPSFEEAEATPSPVLPDIVMEAPLNSLLPSAGASV 608 

I I II =: =1 I : I I I - I II II 

Db 1387 TSTSTSDSASTSTSVSDS NSASTSLSGSLSTSVSDSTSTSTSDSASAST 1435 

Qy 609 VQPSVSPLEAPPPVSYDSIKLEPENPPPYEEAMNVALK-ALGTKEGIKEPESFNAAVQET 667 

Ml II :: : : : | : | | : :: |: 

Db 1436 SESDSERA STSLSGSTSTSISDSTSTSTSDSASTSTSVSESNSTSTSISES 1486 

Qy 668 EAPYISIACDLIKETKLSTEPSPDFSNYSEIAKFEKSVPEHAELVEDSSPESEPVDLFSD 727 

: : I ||||::::: :| | |: :|| 

Db 1487 LSTSVS DSTSTSTSDSASTSTSVSDSDSASTSSSESV- -STSDSESTSTSTS 1536 

Qy 728 DSI PEVPQTQEEAVMLMKESLTEVSETVAQHKEERLSASPQE LGKPYLESFQ 779 

II | :: : | : | ||:: : : ||| | || 

Db 1537 DSASTSTSVSESNSTSTSLSGSTSTSVSDSTSTSTSDSASASTSESDSDSASTSSSESVS 1596 

Qy 780 PNLH STKDAASNDIPTLTKKEKISLQMEEFNTAI YSNDDLLSSKEDKI KESETFS 834 

II ::|| : I :|:: : : :| lh | 

Db 1597 TSVSDSTSASTSESASTSTSVSDSNSASTSLSESTSTSLSDSTSMSTSDSASTSTSESDS 1656 

Qy 835 DSSPIEIID EFPTFVSAKDDSPKLAKEYTDLEVSDKSEIANIQSGADS 882 

IK : I I III = : : ■ I I : : =lh I 

Db 1657 DSASTSLSDSTSTSVSESTSTSTSTSVSASNSTSTSLSDSRSTSLSDSTSTSTSESGSTS 1716 

Qy 883 LPCLELPCDLSFKNI YPKDEVHVSDEFSENR- -SSVSKASISPSNVSALEPQTEMGSIVK 940 

I I = := HI I = h = hi II - : | 

Db 1717 TS - - ESDSDSASTSLSESTSTS I SDSTSTSTSDSASTSMS VSDSNRASTSLSDSTSTS VS 1774 

Qy 941 SKSLTKEAEKKLPSDTEKEDRSLSAVLSAELSKTSWD 978 

: I I I I I I II I = III I 

Db 1775 DSTSASTSESASTSTRESESTSASTSLS-ESTSTSVSD 1811 



RESULT 14 
US-08-978-277A-4 

Sequence 4, Application US/08978277A 
Patent No. 6582956 
GENERAL INFORMATION: 

APPLICANT: Gelman, Irwin H. 

TITLE OF INVENTION: TUMOR SUPPRESSOR GENE 
NUMBER OF SEQUENCES : 2 0 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: Brumbaugh, Graves, Donohue & Raymond 
STREET: 30 Rockefeller Plaza 
CITY: New York 
STATE : NY 
COUNTRY : USA 
ZIP: 10112-0228 
COMPUTER READABLE FORM: 
MEDIUM TYPE: Diskette 
COMPUTER: IBM Compatible 
OPERATING SYSTEM: DOS 
SOFTWARE: FastSEQ Version 1.5 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/08/978 , 277A 



FILING DATE: 

CLASSIFICATION: 514 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: 08/665,401 

FILING DATE: 18-JUN-1996 
ATTORNEY/AGENT INFORMATION: 

NAME: Clark, Richard S 

REGISTRATION NUMBER: 26,154 

REFERENCE/DOCKET NUMBER: A30558 - 165/34008 
TELECOMMUNICATION INFORMATION: 
TELEPHONE: 212-408-2558 
TELEFAX: 212-765-2519 
TELEX : 

INFORMATION FOR SEQ ID NO: 4: 
SEQUENCE CHARACTERISTICS: 
LENGTH: 1596 amino acids 
TYPE: amino acid 
STRANDEDNESS : single 
TOPOLOGY: linear 
MOLECULE TYPE: peptide 
HYPOTHETICAL: NO 
; . ANTI -SENSE: NO 

FRAGMENT TYPE: internal 
ORIGINAL SOURCE: 
US-08-978-277A-4 

Query Match 4.4%; Score 254.5; DB 4; Length 1596; 

Best Local Similarity 19.9%; Pred. No. 1.3e-07; 

Matches 258; Conservative 162; Mismatches 468; Indels 409; Gaps 57; 

Qy 7 SSLVSSSTDSP PRPPPAFKYQFVTEPEDEEDEEEEEDEEEDDEDLEELE 55 

II 1 = 1 I I I =11 = | | :hl h hh h I 

Db 273 SSPVNSETTSSFKKFFTHGWAGWRKKTSFKKSKEDDLETAEKRKEQEAEKVDEEEKEKTE 332 

Qy 56 VL- - ERKPAAGLSAAAVP PAAAAPLLDFSSDSVPPAPRGPLPAAPPAAPERQPSWER 110 

I = = I I I : INI: || h : : 

Db 333 PASEEQEPAEDTDQARLSADYEKVELPLEDQVGDLEASSEEKCAPLATEVFDEKMEAHQE 3 92 

Qy HI SPA APAPSLPPAAAVLPSKLPEDDE 135 

I MM I = = |:: I 

Db 3 93 WAEVHVSTVEKTEEEQGGGGEAEGGWVEGTGESLPPEKLAEPQEVPQEAEPAEELMKS 4 52 

Qy 136 PPARPPPPPPAG ASPLAEPAAPP 158 

I : "I II III = : 

Db 4 53 REMCVSGGDHTQLTDLSPEEKTLPKHPEGIVSEVEMLSSQERIKVQGSPLKKLFSSSGLK 512 

Qy 159 - -STPAAPKRRGSGSVDETLFALPAASEP V I PSSAEKIMDLMEQPG-NTVSSGQEDF 212 

I Mil : I | : : | ||: \\ | :: || :| 

Db 513 KLSGKKQKGKRGGGGDEE PGEYQHIHTESPESAD EQKGESSASSPEEPE 561 

Qy 213 PSVLLET AASLPSLSPLSTV SFKEHG 238 

- M h -I I Ml 

Db 562 ETTCLEKGPLEAPQDGEAEEGTTSDGEKKREGITPWASFKKMVTPKKRVRRPSESDKEEE 621 

Qy 239 YLGNLS A - VS S S EGT I E E TLNEASK- ELPERATNPFVNRD LAEFSELEYS 286 

II HI- hi h I I I hi = h = 

Db 622 LEKVKSATLSSTDSTVSEMQDEVKTVGEEQKPEEPKRRVDTSVSWEALICVGSSKKRARK 681 



Qy 287 EMGSSFKGSPK GESAI LVENTKE EVI VRSKDKEDLVCSAALHSPQESPVGKE 338 

I : I h hi I :|: : : I ::| II! I 

Db 682 ASSSDDEGGPRTLGGDSHRAEEASKDKEAGTDAVPASTQEQDQAQGSSSPEPAGSPSEGE 741 

Qy 33 9 DRWSP EKTMDIFNEMQMSWAPVREEYADFKPFEQAW-EVKD 380 

hhl II I I : : I III :| i 

Db 742 GVSTWESFKRLVTPRKKSKSKLEEKAEDSSVEQLSTEIEPSREE SWVSIKK 792 

Qy 381 TYEGSRDVLAARANVESKVDRKCLEDS LEQKSLGKDS 417 

Db 793 FIPGRR KKRADGKQEQATVEDSGPVEINEDDPNVPAWPLSEYNAVEREKM- -EA 845 

Qy 418 EGRNE DASFPSTPEPVKDSSRAYITCASFTSAT 450 

: I I I ■■ I |:: I ::h II I 

Db 846 QGNTELPQLLGAVYVSEELS KTLVHTVS VAVI DGTRAVT - - S VEERS PSW I S - AS VTEPL 902 

Qy 451 ESTTANTFPLLEDHTSENKTDEKKIEERKAQI ITEKTSPKTSNPFLVAVQDSEADYVTTD 510 

II I :h I |||: -h I h :|: | ||:: 

Db 903 EHTAGEAMPPVEEVT EKDI IAEETPVLTQ-TLPEG KDAHDDM VTS E 947 

Qy 511 TLSKVTEAAVSNMPEGLTPDLVQEACES - ELNEATGTKI AYETKVDLVQTSEAI QESLYP 569 

I I :| II ^ h III I hi : :| 

Db 948 VDFTSEAVTATETSEALRTEEVTEASGA E ETTDMVSAVS QLTDS PDT 994 

Qy 570 TAQLCP SFEEAEATPSPVLPDI VMEAPLNSLLPSAGASWQPSVSPLEAPP 620 

I : I III :| : : I :| I :| : || 

Db 995 TEEATPVQEVESGVLDTEEEERQTQAI LQAVADKVKEESQVP - ATQTVQRTGSKALEKVE 1053 

Qy 621 PVSYDSIKLEPENPP PYEEAMNVALKALGTKEGIKEPESFNAAVQETEAPYISI 674 

I II I I I :|| I I h: I III : ::: 

Db 1054 EVEEDSEVLASEKEKDVMPKGPVQEAGAEHL-AQGSETGQATPESLEVPEVTADVDHVA- 1111 

Qy 675 ACDLIKETKLSTEP-SPDFSNYSEIAKFEKSVPEHAE'LVEDSSPESEPVDLFSDDS 729 

I =11 H : :h I :: | | | : : | :| | || 

Db 1112 TCQVIKLQQLMEQAVAPESSETLTDSETNGSTPLADSDTADGTQQDETID- -SQDSKATA 1169 

Qy 730 - I PEVPQTQEEAVMLMKESLTEVSETVAQHKEERLSASPQELGKPYLESFQPNLHSTKDA 788 

: : 1 = 111 || : : | :| :| |: || | | | 
Db 1170 AVRQSQVTEEEAATAQKEEPSTLPNNVPAQEEH GEEPGRDVLEPTQQEL TA 1220 

Qy 789 ASNDI PTLTKK EKISLQMEEF NTAI YSNDDL LS 821 

h :| I I ||: : | | | :: : 

Db 1221 AA--VPVLAKTEVGQEGEVDWLDGEKVKEEQEVFVHSGPNSQKAADVTYDSEVMGVAGCQ 1278 

Qy 822 SKED KI KESETFSDSSPI EI IDEFPTFVSAKDDSPKLAKEY TDL 8 65 

II = = l I =| I : I II = = I h h 

Db 1279 EKESTEVQSLSLEEGEMETDVEK-EKRETKPEQVSEEGEQETAAPEHEGTYGKPVLTLDM 1337 

Qy 866 E VSDKS E I AN I QSGADS L P CLE 88 7 

I- = h III hi 

Db 1338 PSSERGKALGSLGGSPSLPDQDKAGCI EVQVQSLDTTVTQTAEAVEKVI ETWISETGES 1397 

Qy 888 LPCDLSFKNI YPKDEVHVSDEF SENRSSVSKAS I S PSNVSALEPQTEMG 936 

II : I II ]■■:: ::|: I I I : I 

Db 1398 PECVGAHLLPAEKSSATGGHWTLQHAEDTVPLGPESQAESI PI I VTPAPESTLHPDLQ-G 1456 



Qy 



937 S I VKS KSLTKEAEKKL PSDTEKEDRSLSAVLSAE 970 



Db 



il- ill I II * • II II 

1457 EI SASQRERSEEEDKPDAGPDADGKESTAI EKVLKAE 1493 




RESULT 15 
US-08-993-118-7 

; Sequence 7 , Application US/08993118 

; Patent No. 5997872 

; GENERAL INFORMATION: 

APPLICANT: LUCAS, Sophie; 

APPLICANT: DE SMET, Charles; 

APPLICANT: BOON-FALLEUR , Thierry 

TITLE OF INVENTION: ISOLATED NUCLEIC ACID MOLECULE CODING FOR TUMOR 
TITLE OF INVENTION: REJECTION ANTIGEN PRECURSOR MAGE -CI AND USES 
TITLE OF INVENTION: THEREOF 
NUMBER OF SEQUENCES: 14 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: Felfe & Lynch 

STREET: 805 Third Avenue 

CITY: New York City 

STATE: New York 

COUNTRY : USA 

ZIP: 10022 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Diskette, 3.5 inch, 360 kb storage 

COMPUTER: IBM PS/2 

OPERATING SYSTEM: PC-DOS 

SOFTWARE: WordPerfect 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/08/993 , 118 

FILING DATE: 

CLASSIFICATION: 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER : US/08/845 , 528 

FILING DATE: April 25, 1997 
ATTORNEY/AGENT INFORMATION: 

NAME: Mary Anne Schofield 

REGISTRATION NUMBER: 36,669 

REFERENCE/DOCKET NUMBER: LUD 5455 
TELECOMMUNICATION INFORMATION: 

TELEPHONE: (212) 688-9200 

TELEFAX: (212) 838-3884 
INFORMATION FOR SEQ ID NO: 7: 
SEQUENCE CHARACTERISTICS: 

LENGTH: 1142 

TYPE: amino acids 

STRANDEDNESS : s ingle- stranded 

TOPOLOGY: linear 
US-08-993-118-7 

Query Match 4.0%; Score 233; DB 2; Length 1142; 

Best Local Similarity 19.6%; Pred . No. 1.9e~06; 

Matches 221; Conservative 156; Mismatches 439; Indels 310; Gaps 48 



Qy 



8 SLVSSSTDSPPRPPPAFKYQFVTEPEDEEDE-EEEEDEEEDDEDLEELEVLERKPAAGLS 66 



Db 



13 SLLQSSSESPQSCP EGEDSQSPLQIPQSSPESDDTLYPLQSPQSRSEG 60 



Qy 67 AAAVPPAAAAPLLDFSSDSVPPAPRGPLPAAPPAAPERQPSWE- -RSPAAPAPSLPPAAA 124 

= Ml = II :| |: | : :|| : | | 

Db 61 EDSSDPLQRPPEGKDSQSPLQI PQSSPEGDDTQSPLQNSQSSPEG- - 105 

Qy 125 VLPSKLPEDDEPPARPPPPPPAG---ASPLAEPAAP PSTPAAPKRRGSGS 171 

: I I Ml III Ih hi : : I 

Db 106 KDSLSPLEISQSPPEGEDVQSPLQNPASSFFSSALLSIFQSSPESIQSPFEG- 157 

Qy 172 VDETLFALPAASEPVI PSSAEKIMDLMEQPGNTVSSGQEDFP SVLLE 218 

I || :|: :: : : s | | || | || 

Db 158 FPQSVLQIPVSAASSSTLVSIFQSSPESTQSPFEGFPQSPLQIPVSRSFSSTLLS 212 

Qy 219 TAASLPSLSPLSTVSFKEHGYLGNLSAVSSSEGTIEETLNEASKELPERATNPFVNRDLA 278 

I I I =: I : =h III II = III 
Db 213 IFQSSPERSQRTSEGFAQSPLQIPVSSSSSS TLLSLFQSS PERTQ 257 

Qy 279 EFSELEYSEMGSSFKGSPKGESAILV ENT KE E V I VR S KD KEDL VC S AALH S P Q E S P 334 

hhl h II :| : | :: || : | 

Db 258 STFEGFPQSPLQI PVSRSFSSTLLSI FQSSPERTQSTFEGFAQSPLQI P 306 

Qy 335 VGKEDRW SPEKTMDIFNEMQMSW-APVREEYAD- -FKPFEQAWE-VKDTYE 383 

I I 1 I E I h= h h : I : hi 

Db 307 VSSSSSSTLLSLFQSSPERTQSTFEGFPQSLLQIPMTSSFSSTLLSIFQSSPESAQSTFE 366 

Qy 3 84 GSRDVLAARANVESKVDRKCLEDSLEQKSLGKDSEGRNEDASFPSTPEPVKDSSRAYITC 443 

Db 367 G FPQSPLQI PGSPSFSSTL 385 

Qy 444 ASFTSATESTTANTF PLLEDHTSENKTDEKKIEERKAQI ITEKTSPKTSNPFLV 4 97 

I - I HI II II = I : = 11 = : = 

Db 386 LSLFQSSPERTHSTFEGFPQSPLQI PMTSSFSSTLLSILiQ SSPESAQSAFE 436 

Qy 4 98 AVQDSEADYVTTDTLSKVTEAAVSNMPEGLTPDLVQEACESELNEATGTKIAYETKVDLV 557 

Db 43 7 GFPQSPLQIPVSSSFSYTLLSLFQSSPE-RTQSTFEGFPQSPLQIPVSSSSSSSTLLSLF 4 95 

Qy 558 QTSEAIQESLY PTAQLCPSFEEAEATPSPVLPDI VMEAP- -LNSL LPS 603 

hi =1 : h I I I I Ih II I HI I 

Db 4 96 QSSPECTQSTFEGFPQSPLQI PQSPPEGENTHSPL- -QI VPSLPEWEDSLSPHYFPQSPP 553 

Qy 604 AGASWQPSVSPLEAPP PVSYDSIKLE- -PENPPPYEEAMNVALKA 647 

I = I I -II I Ih |::|| h : h 

Db 554 QGEDSLSPHYFP-QSPPQGEDSLSPHYFPQSPQGEDSLSPHYFPQSPPQGEDSMSPLYFP 612 

Qy 648 LGTKEGI KEPESFNAAVQETEAPYI S I ACDLI KETKLSTEPSPDFSNYSEIAKFEKSVPE 707 

:| | | :::| :|| | |: || :: | :: | 

Db 613 QSPLQG EEFQSSLQSP VSI-CS SSTPSSLPQSFPESSQSPPEGPV 656 

Qy 708 HAELVEDSSPESEPVDLFSD DSIPE VPQTQEEAVMLMKESLTEVS 752 

= I Ihl I : I :| II Hh I -Ih : 

Db 657 QSPL- - 7HSPQSPPEGMHSQSPLQSPESAPEGEDSLSPLQI PQSPLEG EDSLSSLH 709 

Qy 753 ETVAQHKEERLSASPQELGK- -PYLESFQPNLHS TKDAASNDI PTLTKKEKI S 803 

= = I I II = I Ml =1 I = I =| = I 

Db 710 FPQSPPEWED-SLSPLHFPQFPPQGEDFQSSLQSPVS1CSSSTSLSLPQSFPESPQSPPE 768 



Qy 804 LQMEEFNTAI YSNDDLLSSKEDKIKESETFSDSSPIEI - IDEFPTFVSAKDDSP 856 

II : h III:: II- : ||: h 

Db 769 GPAQSPLQRPVSSFFSYTLASLLQSSHESPQSPPEGPAQSPLQSPVSSFPSSTSS 823 

Qy 857 KLAKEYTDLEVSDKSEI ANI QSGADSLPCLELPCDLSFKNI YPKDEVH VSDEFSENR 913 

Db 824 SLSQSSPVSSFPSSTSS SLSKSSPESPLQSPVISFSSSTSL 864 

Qy 914 SSVSKAS I S PSNVSALEPQTEMGS I VKSKSLTKEAEKKLPSD TEKEDR 961 

I hill: I : --I III = = | = h III 

Db 865 SPFSEESSSPVD EYTSSSDTLLESDSLT-DSESLI ESEPLFTYTLDEKVDELARFL 919 

Qy 962 SLSAVLSAELSK-TS WDLLYWRDI KKTGWFGASL 996 

Db 920 LLKYQVKQP I TKAEMLTNVI SRYTGYFPVI FRKAREF I EI LFGI SL 965 



Search completed: December 19, 2003, 15:37:55 
Job time : 25 sees 



GenCore version 5.1.6 
Copyright (c) 1993 - 2 003 Compugen Ltd. 



OM protein - protein search, using sw model 



Run on: 



Title: 

Perfect score: 
Sequence : 

Scoring table: 



December 19, 2003, 15:36:49 ; Search time 40 Seconds 

(without alignments) 
5430.007 Million cell updates/sec 

US-09-830-972-2 
5848 

1 MEDIDQSSLVSSSTDSPPRP VKDAMAKIQAKI PGLKRKAD 1163 

BLOSUM62 

Gapop 10.0 , Gapext 0.5 



Searched: 



696363 seqs, 186758610 residues 



Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 



696363 



Database : Published_Applications_AA: * 

1 : /cgn2_6/ptodata/l/pubpaa/US07_PUBCOMB.pep: * 

2 : /cgn2_6/ptodata/l/pubpaa/PCT_NEW_PUB.pep:* 

3 : /cgn2_6/ptodata/l/pubpaa/US06_NEW_PUB .pep 

4 : /cgn2_6/ptodata/l/pubpaa/US06_PUBCOMB .pep 

5 : /cgn2_6/ptodata/l/pubpaa/US07_NEW_PUB.pep 

6 : /cgn2_6/ptodata/l/pubpaa/PCTUS_PUBCOMB.pep: 

7 : /cgn2_6/ptodata/l/pubpaa/US08_NEW_PUB .pep : * 

8 : /cgn2_6/ptodata/l/pubpaa/US08_PUBCOMB.pep: * 

9 : /cgn2_6/ptodata/l/pubpaa/US09A_PUBCOMB.pep: 
10 : /cgn2_6/ptodata/l/pubpaa/US09B_PUBCOMB.pep 
11 : /cgn2_6/ptodata/l/pubpaa/US09C_PUBCOMB.pep 
12 : /cgn2_6/ptodata/l/pubpaa/US09_NEW_PUB.pep: 
13 : /cgn2_6/ptodata/l/pubpaa/US10A_PUBCOMB.pep 
14 : /cgn2_6/~ptodata/l/pubpaa/US10B_PUBCOMB.pep 
15 : /cgn2_6/ptodata/l/pubpaa/US10C__PUBCOMB.pep 
16 : /cgn2__6/ptodata/l/pubpaa/US10__NEW__PUB.pep 
17 : /cgn2_6/ptodata/l/pubpaa/US60__NEW_PUB .pep 
18 : /cgn2_6/ptodata/l/pubpaa/US60_PUBCOMB.pep 

Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 
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Sequence 3, Appli 


38 


267 


4 


6 


1781 


11 
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Sequence 13 , Appl 
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1325 
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US-09-864-761-35612 


Sequence 35612, A 
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259.5 
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4 


617 
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US-09-864-761-36182 ' 


Sequence 36182, A 


41 


259.5 


4 


4 


617 


12 


US-10-029-386-32114 


Sequence 32114, A 


42 


255.5 


4 


4 


1178 


15 


US-10-128-714-8240 


Sequence 8240, Ap 
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4 


5877 


15 


US-10-142-515-11 


Sequence 11, Appl 
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4 . 
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5935 
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US-10-243-243A-8 


Sequence 8, Appli 
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4 
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US-09-902-432-4 


Sequence 4, Appli 



ALIGNMENTS 



RESULT 1 

US-09-893-348-18 

; Sequence 18, Application US/09893348 
; Patent No. US20020072493A1 
; GENERAL INFORMATION: 

; APPLICANT: EI SENBACH- SCHWARTZ , Michal 



APPLICANT: COHEN , I run R. 
APPLICANT: BESERMAN, Pierre 
APPLICANT: MOSONEGO, Alon 
APPLICANT: MOALEM, Gila 

TITLE OF INVENTION: ACTIVATED T- CELLS , NERVOUS SYSTEM-SPECIFIC ANTIGENS AND 
THEIR USES 

FILE REFERENCE: EI S -SCHWARTZ-2A 
CURRENT APPLICATION NUMBER: US/09/893 , 348 
CURRENT FILING DATE: 2001-06-28 
PRIOR APPLICATION NUMBER: US 09/314,161 
PRIOR FILING DATE: 1999-05-19 
PRIOR APPLICATION NUMBER: US 09/218,277 
PRIOR FILING DATE: 1998-12-22 
PRIOR APPLICATION NUMBER: PCT/US98/14715 
PRIOR FILING DATE: 1998-07-21 
PRIOR APPLICATION NUMBER: IL 124500 
PRIOR FILING DATE: 1998-05-19 
NUMBER OF SEQ ID NOS : 29 
SOFTWARE: Patentln version 3.1 
SEQ ID NO 18 
LENGTH: 1163 
TYPE : PRT 

ORGANISM: Rattus norvegicus 
US-09-893-348-18 

Query Match 100.0%; Score 5848; DB 9; Length 1163; 

Best Local Similarity 100.0%; Pred. No. 5.7e-287; 

Matches 1163; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 
Qy 1 MEDIDQSSLVSSSTDSPPRPPPAFKYQFVTEPEDEEDEEEEEDEEEDDEDLEELEVLERK 60 

I I I I I M I ! I M i I M I I I I I MM 

Db 1 MEDIDQSSLVSSSTDSPPRPPPAFKYQFVTEPEDEEDEEEEEDEEEDDEDLEELEVLERK 60 

QY 61 PAAGLSAAAVPPAAAAPLLDFSSDSVPPAPRGPLPAAPPAAPERQPSWERSPAAPAPSLP 12 0 

M I I I i 1 l I I I I ' Ml I I I I I I I I l:U 

Db 61 PAAGLSAAAVPPAAAAPLLDFSSDSVPPAPRGPLPAAPPAAPERQPSWERSPAAPAPSLP 12 0 

Qy 121 PAAAVLPSKLPEDDEPPARPPPPPPAGASPLAEPAAPPSTPAAPKRRGSGSVDETLFALP 18 0 

IIIIIIIIIMIIIIIMMIIIMIIIIIIIIIIIIMMIIIIIIIIMIilllllll 

Db 121 PAAAVLPSKLPEDDEPPARPPPPPPAGASPLAEPAAPPSTPAAPKRRGSGSVDETLFALP 18 0 

Qy 181 AASEPVIPSSAEKIMDLMEQPGNTVSSGQEDFPSVLLETAASLPSLSPLSTVSFKEHGYL 24 0 

IIIIIIMIIMIIIMMIIIIIIIIIIIIIIIIIIMMMIIMIIIIIIIMIIII 

Db 181 AASEPVI PSSAEKIMDLMEQPGNTVSSGQEDFPSVLLETAASLPSLSPLSTVSFKEHGYL 24 0 

Qy 241 GNLSAVSSSEGTI EETLNEASKELPERATNPFVNRDLAEFSELEYSEMGSSFKGS PKGES 3 00 

lllllllllllllllllllllillllllllllllllllllllllllllllllllllllll 

Db 241 GNLSAVSSSEGTI EETLNEASKELPERATNPFWRDLAEFSELEYSEMGSSFKGSPKGES 300 

Qy 301 AILVENTKEEVI VRSKDKEDLVCSAALHSPQESPVGKEDRWSPEKTMDIFNEMQMSWA 360 

M I I ' I ! I i I I I ; .| I Mi | | | | | i ,| |,| 

Db 301 AILVENTKEEVI VRSKDKEDLVCSAALHSPQESPVGKEDRWSPEKTMDIFNEMQMSWA 360 

Qy 361 PWEEYADFKPFEQAWEVKDTYEGSRDVLAARANVESKVDRKCLEDSLEQKSLGKDSEGR 42 0 

I ! I I I ! I I i I I M I I I I I M M' III: 

Db 361 PWEEYADFKPFEQAWEVKDTYEGSRDVLAARANVESKVDRKCLEDSLEQKSLGKDSEGR 42 0 



Qy 


421 


NEDASFPSTPEPVKDSSRAYI TCASFTSATESTTANTFPLLEDHTSENKTDEKKI EERKA 


480 


Db 


421 


Ml 1 1 : 1 M 1 ! 1 Ml 1 1 1 1 M I I I I h i , 

NEDASFPSTPEPVKDSSRAYITCASFTSATESTTANTFPLLEDHTSENKTDEKKI EERKA 


480 


Ov 


481 


QIITEKTSPKTSNPFLVAVQDSEADYVTTDTLSKVTEAAVSNMPEGLTPDLVQEACESEL 
1 I 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 ! M 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 I I I I I 1 1 1 1 1 1 


540 


Db 


481 


III i Mill 1 l l l 1 M 1 1 | | | | | | | M M M M M M M 1 

QI ITEKTSPKTSNPFLVAVQDSEADYVTTDTLSKVTEAAVSNMPEGLTPDLVQEACESEL 54 0 


Ov 


541 


NEATGTKIAYETKVDLVQTSEAIQESLYPTAQLCPSFEEAEATPSPVLPDIVMEAPLNSL 


600 


Db 


541 


1 ' 1 1 1 I 1 1 1 1 ,U , MM 1 1 I 1 Ml 1 1 i 

NEATGTKIAYETKVDLVQTSEAIQESLYPTAQLCPSFEEAEATPSPVLPDIVMEAPLNSL 


600 


Ov 


601 


LPSAGASWQPSVSPLEAPPPVSYDSIKLEPENPPPYEEAMNVALKALGTKEGIKEPESF 


660 


Db 


601 


II MMI IMM! Illllllllll lllillllMII MIMIIIIIIIIMIIIIIM 1 

LPSAGASWQPSVSPLEAPPPVSYDSIKLEPENPPPYEEAMNVALKALGTKEGIKEPESF 


660 


Ov 


661 


NAAVQETEAPYISIACDLIKETKLSTEPSPDFSNYSEIAKFEKSVPEHAELVEDSSPESE 


720 


Db 


661 


IIIIIIM IMIMMIMM III Mill MIMMIMMMMII MIMIIMI III 

NAAVQETEAPYISIACDLIKETKLSTEPSPDFSNYSEIAKFEKSVPEHAELVEDSSPESE 


720 


Ov 


721 


PVDLFSDDSIPEVPQTQEEAVMLMKESLTEVSETVAQHKEERLSASPQELGKPYLESFQP 

MIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIMIIIIIII 


780 


Db 


721 


PVDLFSDDSIPEVPQTQEEAVMLMKESLTEVSETVAQHKEERLSASPQELGKPYLESFQP 


780 


Ov 


781 


NLHSTKDAASNDI PTLTKKEKISLQMEEFNTAIYSNDDLLSSKEDKI KESETFSDSSPIE 


840 


Db 


781 


IIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIMIIIIIIIIIIIIMIIIIIMII 

NLHSTKDAASNDI PTLTKKEKISLQMEEFNTAI YSNDDLLSSKEDKI KESETFSDSSPIE 


840 


Ov 


841 


I IDEFPTFVSAKDDSPKLAKEYTDLEVSDKSEIANIQSGADSLPCLELPCDLSFKNI YPK 

MIIIIIMIIIIIIIIIIIMIMIIIIIIMIIMIIIMIIIIIIIIIIIIIIIIII 


900 


Db 


841 


I IDEFPTFVSAKDDSPKLAKEYTDLEVSDKSEIANIQSGADSLPCLELPCDLSFKNI YPK 


900 


Ov 


901 


DEVHVSDEFSENRSSVSKASISPSNVSALEPQTEMGSIVKSKSLTKEAEKKLPSDTEKED 


960 


Db 


901 


I 

DEVHVSDEFSENRSSVSKASISPSNVSALEPQTEMGSIVKSKSLTKEAEKKLPSDTEKED 


960 


Ov 


961 


RSLSA VLSAELSKTS WDLLYWRDI KKTGWFGASLFLLLSLTVFS I VSVTAYI ALALLS 
1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 II 1 1 1 1 1 1 


1020 


Db 


961 


1 M II II II II II II II II II II 1 II 1 II M 1 M M M II M M 1 M M M M II II 1 II 

RSLSA VLSAELSKTS WDLLYWRDI KKTGWFGASLFLLLSLTVFS I VSVTAYI ALALLS 


1020 


Ov 
vy 


1021 


VTI SFRI YKGVI QAI QKSDEGHPFRAYLESEVAI SEELVQKYSNSALGHVNSTI KELRRL 


1080 


Db 


1021 


! 1 1 ' I i 1 1 ! i 1 Li ' IM 1 1 1 1 1 MM 1 1:1,1 1 

VTI SFRI YKGVI QAI QKSDEGHPFRAYLESEVAI SEELVQKYSNSALGHVNSTI KELRRL 


1080 


Qy 


1081 


FLVDDLVDSLKFAVLMWVFTYVGALFNGLTLLI LALISLFS I PVI YERHQVQI DHYLGLA 


1140 


Db 


1081 


I 1 1 U , 1 1 !, Ml ; 1 Ml i 1 1 1 Ml 1 1 1 1 1 1 1 i 

FLVDDLVDSLKFAVLMWVFTYVGALFNGLTLLI LALI SLFS I PVI YERHQVQI DHYLGLA 


1140 


Qy 


1141 


NKSVKDAMAKI QAKI PGLKRKAD 1163 




Db 


1141 


IIIIIIIMIIIIIMMIIMI 

NKSVKDAMAKI QAKI PGLKRKAD 1163 




RESULT 2 
US-09-758 


-140- 


-6 





; Sequence 6, Application US/09758140 
; Patent No. US20020012965A1 



; GENERAL INFORMATION: 

; APPLICANT: Strittmatter , Stephen M. 

; TITLE OF INVENTION: No. US20020012965A1O Receptor-Mediated Blockade of 
Axonal Growth 

; FILE REFERENCE: 44574 -5073 -US 

; CURRENT APPLICATION NUMBER: US/ 09/758 , 14 0 

; CURRENT FILING DATE: 2001-01-12 

PRIOR APPLICATION NUMBER: US 60/175,707 
; PRIOR FILING DATE: 2000-01-12 

PRIOR APPLICATION NUMBER: US 60/207,366 
; PRIOR FILING DATE: 2000-05-26 
; PRIOR APPLICATION NUMBER: US 60/236,378 

PRIOR FILING DATE: 2000-09-29 
; NUMBER OF SEQ ID NOS : 20 
; SOFTWARE: Patent In Ver . 2.1 
; SEQ ID NO 6 

LENGTH: 1192 
TYPE : PRT 

ORGANISM: Homo sapiens 
US-09-758-140-6 



Query Match 75.3%; Score 4403.5; DB 9; Length 1192; 

Best Local Similarity 75.9%; Pred. No. 4.7e-214; 

Matches 909; Conservative 104; Mismatches 145; Indels 39; Gaps 20; 
Qy 1 MEDIDQSSLVSSSTDSPPRPPPAFKYQFVTEPEDEEDEEEEEDEEEDDEDLEELEVLERK 60 

llhlll Mill MINI MINIM MUM I II I h I h = II II 1 1 1 1 II II I 

Db 1 MEDLDQSPLVSSS-DSPPRPQPAFKYQFWEPEDEE-EEEEEEEEDEDEDLEELEVLERK 58 

Qy 61 PAAGLSAAAVP--PAAAAPLLDFSSDSVPPAPRGPLPAAPPAAPERQPSWERSPAA P 115 

I II III Mhll U Mllllllllllll II II : I 

Db 59 PAAGLSAAPVPTAPAAGAPLMDFGNDFVPPAPRGPLPAAPPVAPERQPSWDPSPVSSTVP 118 



Qy 116 APSLPPAAAVLPSKLPEDDEPPARPPPPPPAGASPLAE PAAPPSTPAAPKR 166 

III MM I I i I I i i || || ; I I I 

Db 119 APSPLSAAAVSPSKLPEDDEPPARPPPPPPASVSPQAEPVWTPPAPAPAAPPSTPAAPKR 178 

Qy 167 RG-SGSVDETLFALPAASEPVI PSSAEKIMDLMEQPGNTVSSGQEDFPSVLLETAASLPS 225 

I! Illllllllllllllllll 1 1 1 1 III MMIhhMMMMMMMMM 

Db 179 RGSSGSVDETLFALPAASEPVIRSSAEN-MDLKEQPGNTISAGQEDFPSVLLETAASLPS 237 

Qy 226 LSPLSTVSFKEHGYLGNLSAVSSSEGTIEETLNEASKELPERATNPFVNRDLAEFSELEY 285 

Mill Mill MUM I MlhM ! :l hi M .11 

Db 238 LSPLSAASFKEHEYLGNLSTVLPTEGTLQENVSEASKEVSEKAKTLLIDRDLTEFSELEY 2 97 



Qy 286 SEMGSSFKGSPKGESAILVENTKEEVIVRSKDKED-LVCSAALHSPQESPVG KED 339 

IIIIMI Ml MhM I :||:||::||:|: || : lh II I III 

Db 2 98 SEMGSSFS VS PKAESAVI VANPREEI I VKNKDEEEKLVSNNI LHNQQELPTALTKLVKED 357 

Qy 340 RWS PEKTMDI FNEMQMS WAPVREEYADFKPFEQAWEVKDTYEGSRDVLAA RANV 395 

Ml M I Ml ::M I M II I I I M ! I I M 11111= I I hill = I = 

Db 358 EWSSEKAKDSFNEKRVAVEAPMREEYADFKPFERVWEVKDSKEDS-DMLAAGGKIESNL 416 



Qy 3 96 ESKVDRKCLEDSLEQKSLGKDSEGRNEDASFPSTPEPVKDSSRAYITCASFT-SATESTT 454 

MMhM Mill : MM hi IMMM Ml I llllll I MMI 

Db 417 ESKVDKKCFADSLEQTNHEKDSESSNDDTSFPSTPEGIKDRSGAYITCAPFNPAATESIA 476 



Ov 


455 


ANTFPLLEDHTSENKTDEKKI EERKAQI ITEK-TSPKTSNPFLVAVQDSEADYVTTDTLS 

1 MM 1 J hi II III MM MUM h 

TNI FPLLGDPTSENKTDEKKI EEKKAOT VTKKMT^TKT^MPRT.VAann^PTnv^rrTnMT ,T 

J- J. J. i. J — iJ— i-J j_ j. ijulmix. j. J—' -i—i a vj. v j. x v X Hi i\_L>l 1 u 1 IN. J. Ql'J It IT 1_j VfifiUUOIj X LJ X V X X LAIN J— 1 X 


513 


Db 


477 


536 


Ov 
wy 


514 


KVTEAAV^MMPEOTiTPDTiVOEAOF^FT.NRATOTK'T AYPTK\/nT .VnTQFa thpct vptaOT, 

MM hlllllllllllllllllllll lllllllllhlllllll 1 III 

KVTEEVVAMMPEnTiTPDTiVOEAPF c !FT,NF\/TnTK"T AYRTKMnT.VOTcrF^TMnRQT Vpa A.DT. 

IV v J- Hi Hi v V jrVLvl'llT IlivJU 1 IT X-/XJ V \^ ilir\ ^ Hi O I— i Hi V X V_J X xv± .rA. X Hi 1 IVlVlXJXj VyiiDlj VI'I^ILOXj X nr\r\\£±J 


D t ~J 


Db 


537 


J JD 




R 74 


LrDT IliIiLrt.IlLrt. X r Or VJjrUl V 1M Iji-i x XjX\J o XjXj n OrtVjLrtO V VyrO V O r 1 iTiHr rr VO l Uj 1 r\ 1 ir. r XjINJ 




Db 


597 


IIIIIMIMIIII HI Mill III Mlllllhlll Mill hhlll MM 

CPSFEE^EATP^PVTiPnT VTVIRAPT.N^AVP^AnA^VTOP^ciQPT.RA Q Q^TM VR Q T prut 1 p FN 

x o i. i_ 1 1— i l3 J— lta. J. it i_? XT vur XJ X V 11 HjLrt. it XJXM Oil V XT O^t-VJ-cAlJ V X \J Ir O O O r LiCin, oj V X\l X XjO X XXXiXli ■ |M 


ODD 


Ov 


634 


PPPYEEAMNVALKAL-nTKEC^IKEPESFMAAVOFTFAPYT APnT.TKFTKT.^TF'P^PnP 

x_ J- J- J- j — i i, .1 ixi 1-L>i v xiUluixj v_j x 1 N^X—J J_ ivLi XT i J jL l>lxiii. V v_J X_j X X-l*T^X X J_ x. rl^w J /X.I x, rv ri x XvXJ l_5 X " J XT O XT L-J C 


D Z> 


Db 


656 


MMMIhhll : I II llllh 1 1 :i M 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 IMI 

PPPYFFAM^V^T .KKV^HT KRRT KRPRNTMAAT.OFTRAPYT <^J APDT.T WFTKT.c:a.RPA. PHF 
rrr i i_ji_LrtJ. 10 v j xjrvrv v o vjt x ivj_i j_j x ixXj it HjXN x lm -rtjri xj ^ Hi x llir\ir x x o X .rtA XjXj X xVili X r\ i i.^a p, iri-irr XJr 




wy 


693 


^NY^RT AKRRK^VPRHART.VRDQ^PR^RPVTYT .R^nTjQT PRVPOTOFRAX/MT MK'PQT TR\7Q 

OXN X O XLi X ralvr Cj IVO V IT HirXrtIliXj v XjXJOO r Ij OJ-ir v XvXjT O XJXJO X rCi V x v V;X-iL1lH. Vl\lXJl v i]vJioXj X 111 V o 

hllhll h IhhIllllllhllllllllMllhlll hi IIMIMI 1 


n c; o 




7 1 


QnY £ ^RMAK'\7'ROP\7PnHQRT VRnQQ pnQRP\7PiT PCnnC T PrtVPni^AFiPT'TTlVIT \7l^T?CT TPTC 
OXJ X OXj1 v XHXYV Lj^ Jr V i-'LJrlO XjXj V CjXJoo X^XJocjX^ V XJljr OJJJJO 1 jtIJ V X^yiX^JJij 1 VlYllj V JXIIjOXj 1 Hj 1 o 


/ /D 


Ov 
^y 


753 


- FTVAOHK-RFPT.Cia CJPOFT.n'K'PVT .RCIROPKTT ,T4QTT< r 'nA - A CMriT PTT ^WWT CT fiWIPPP 

Dl Vn^niX CjIlilxXJOiT.0 i V^IliXJOlVlr XXjX_jOF ly/r^XNlXJllO 1 IvX>jHl rt.n 1M 1 J 1 r 1 III f\ Pl rt r\ I ,71 1 '^1*^ "> r* r 


q n q 


Db 


776 


1^ 1 1 | | . III :| :||| ::: hi 1 1. 

RRQMT RYRMKRKT.C! AT iPPROnKPYT .R^RKT .QT .nKTTKnTT J .PDRV^TT <ZWT?\<T PT HMRRT 


C3 -3 o 


Ov 


OIU 


XM Xi-lX X OXNX^XJXjXJOOrvCjXJiVX xVILoXIj 1 r OJJDO Jt X CjI 1 Iv-CiT rlr Vo/\l\XJJJo ~ K K 1 iAKK I 1 JJXjXj vb 


O DO 


Db 


836 


= 11111111 111 1 1 M 1 1 1 i 1 1 1 1 1 1 1 1 1 1 E If II II hi Ml MM 

STAVYSNDDTiRT ^KRAOTRRTFTF^D^^PT FT TDFPPTTJc;^KTnc;pc;RT.aRPYTnT;Wc; 


0 _7 D 


Ov 

wy 


86 9 


HKCJRT AMTOQOADClT.PPT.PT.Pr'riT.QRK'MT YP^ DR^7TT\7QTlFFC!RMPQC;\7'C i^a C T CPCMV 

UIVOCiXruM X K^J i_> Vji-iXJ O XJ ir V_Xj£LXj It V„XJXJiDX' f\XM X I r A. XJXIjVliVOXJXjr OCilMKoo VOlVHO X O lr OXM V 


Q 0 ^ 


Db 


8 96 


llllll II MM Ml III III II ::: Ihlhl h II = 1 M 

HK^FTAWAPri^An^T.prTFT.PwnT.QT.i^MTnp^wffKTQPQnnycii^M t t ppn^T" 

uiXDD X rtXNIrt. r U\j2~\\j o Xj lr ^ x XjXj Jr nXJXjOXjiXXM X J\v HiJ_ii\X j r O XJXJ Ir o i\i\J o iZJ/t. X O A.V XjXjXj ir tr LJ V 


Q cr 


Ov 
wy 


927 


^AT.RPOTRMH^T VK^K^T.TKFAFKK'T.P^riTRK'FnPQT.CIAATT.QART QT^TQ\A7TiT T VWPnTTf 

OjtIXjXj it ^ X CjI'IUO X V IVO jaOXj X r^XajrlXj J\l\XjrO U X Xli ivlli LJ ij Xj Ort V XjOJt.X-iXjO l\l jv V XJXjXj X HKUl J\ 


_? O D 


Db 


956 


Ml 1 h MM 1 1 MM llllll lllllll Ih M 1 1 II 1 1 1 1 1 1 1 1 1 1 1 

^AT.ATOART R^T VKPKW.VKFAFKKT.P^nTFK'RnR crpca t pCflpT.c^TCliATnT T VWPHT Y 


1U1D 


Ov 
wy 


9R 7 


KTn\A/RnA^T.RT .T J .^T .TVF^ T VC!\7TA VT ZiT.ZlT.T.Q'\7 T rT CT7R T VKHATT Tn^QnPHUPPP A 
iv x vj v v X* urioxir xjxjxjoxj x vrox vov l±\ i xrtX-iM.i_ixjo v ii or ri x xvvj v x X v^r^iDXJxjorl ir r i\J\ 


X U*± D 


Db 


1016 


IIIIIIIIIIIIIIIIIIIIIMIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIM 

KTGWFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVIQAIQKSDEGHPFRA 


1075 


Qy 


1047 


YLESEVAI SEE L VQ KY S N S ALGH VNS T I KELRRLFLVDDLVDSLKFAVLMWVFTYVGALF 


1106 


Db 


1076 


IIMIMMIII MM 1 IIMII II II MINI II 1 Mill IIIIIIIIIMIMI 1 

YLESEVAI SEELVQKYSNSALGHVNCTI KELRRLFLVDDLVDSLKFAVLMWVFTYVGALF 


1135 


Qy 


1107 


NGLTLLI LAL ISLFSI PVI YERHQVQI DHYLGLANKS VKDAMAKI QAKI PGLKRKAD 1163 

llllll llllllllhil IMMI llllll MM h IIIIIIIMII lllllllh 


Db 


1136 


NGLTLLIlxALISLFSVPVIYERHQAQIDHYLGLANI<x\JVI<PAMAKIQAKIP 1192 



RESULT 3 

US-09-972-599A-6 

; Sequence 6, Application US/09972599A 
; Patent No. US20020077295A1 
; GENERAL INFORMATION: 

; APPLICANT: STR I TTMATTER , STEPHEN IVL 

TITLE OF INVENTION: NOGO - RECEPTOR -MEDIATED BLOCKADE OF AXONAL GROWTH 
FILE REFERENCE: C077 CIP US 



; CURRENT APPLICATION NUMBER: US/09/972 , 599A 
; CURRENT FILING DATE: 2001-10-06 

PRIOR APPLICATION NUMBER: PCT/USOl/01041 

PRIOR FILING DATE: 2001-01-12 

PRIOR APPLICATION NUMBER: 09/758,140 

PRIOR FILING DATE: 2001-01-12 

PRIOR APPLICATION NUMBER: 60/236,378 

PRIOR FILING DATE: 2000-09-29 

PRIOR APPLICATION NUMBER: 60/207,366 

PRIOR FILING DATE: 2000-05-26 
; PRIOR APPLICATION NUMBER: 60/175,707 
; PRIOR FILING DATE: 2000-01-12 
; NUMBER OF SEQ ID NOS : 57 

SOFTWARE: Patent In Ver . 2.1 
; SEQ ID NO 6 

LENGTH: 1192 
TYPE: PRT 

ORGANISM: Homo sapiens 
US-09-972-599A-6 

Query Match 75.3%; Score 44 03.5; DB 9; Length 1192; 

Best Local Similarity 75.9%; Pred. No. 4.7e-214; 

Matches 909; Conservative 104; Mismatches 145; Indels 39; Gaps 20; 



Qy 1 MEDIDQSSLVSSSTDSPPRPPPAFKYQFVTEPEDEEDEEEEEDEEEDDEDLEELEVLERK 60 

Mhlll Mill MINI MINIM MUM 1 1 1 1 h I h M I II 1 1 1 1 1 ! 1 1 1 

Db 1 MEDLDQSPLVSSS-DSPPRPQPAFKYQFVREPEDEE-EEEEEEEEDEDEDLEELEVLERK 58 

Qy 61 PAAGLSAAAVP- -PAAAAPLLDFSSDSVPPAPRGPLPAAPPAAPERQPSWERSPAA P 115 

i I I I I f I I M Ml llhll : I MIMMIMIMI llllllll: || : | 

Db 59 PAAGLSAAPVPTAPAAGAPLMDFGNDFVPPAPRGPLPAAPPVAPERQPSWDPSPVSSTVP 118 

Qy 116 APSLPPAAAVLPSKLPEDDEPPARPPPPPPAGASPLAE PAAPPSTPAAPKR 166 

III I I I I I 1 I I I ' I ! M II lllllllllllll 

Db 119 APSPLSAAAVSPSKLPEDDEPPARPPPPPPASVSPQAEPVWTPPAPAPAAPPSTPAAPKR 178 

Qy 167 RG-SGSVDETLFALPAASEPVIPSSAEKIMDLMEQPGNTVSSGQEDFPSVLLETAASLPS 22 5 

M I M 1 1 M M I M 1 1 1 1 1 1 1 MM III 

Db 179 RGSSGSVDETLFALPAASEPVIRSSAEN-MDLKEQPGNTISAGQEDFPSVLLETAASLPS 237 

Qy 226 LSPLSTVSFKEHGYLGNLSAVSSSEGTIEETLNEASKELPERATNPFVNRDLAEFSELEY 285 

Mill IMM MUM I :|lh:| = = 11111= hi = = IM IMIMI 

Db 238 LSPLSAASFKEHEYLGNLSTVLPTEGTLQENVSEASKEVSEKAKTLLIDRDLTEFSELEY 297 

Qy 286 SEMGSSFKGSPKGESAILVENTKEEVIVRSKDKED-LVCSAALHSPQESPVG -KED 33 9 

lllllll Ml lll = = l I =II = II = = MM: II = 11= II I Ml 
Db 298 SEMGSSFSVSPKAESAVI VANPREEI IVKNKDEEEKLVSNNILHNQQELPTALTKLVKED 357 

Qy 340 RWSPEKTMDIFNEMQMSWAPVREEYADFKPFEQAWEVKDTYEGSRDVLAA RANV 395 

Ml II I III I IhlMlllllllh llllh I I hill =h 

Db 358 EWSSEKAKDSFNEKRVAVEAPMREEYADFKPFERVWEVKDSKEDS-DMLAAGGKIESNL 416 

QY 3 96 ESKVDRKCLEDSLEQKSLGKDSEGRNEDASFPSTPEPVKDSSRAYITCASFT-SATESTT 454 

MMhll Mill = MM hi lllllll =11 I llllll I Mill 

Db 417 ESKVDKKCFADSLEQTNHEKDSESSNDDTSFPSTPEGIKDRSGAYITCAPFNPAATESIA 476 

Qy 455 ANTFPLLEDHTSENKTDEKKI EERKAQI ITEK-TSPKTSNPFLVAVQDSEADYVTTDTLS 513 



I 1 1 1 1 I MMMMMIMMMIMM II MINIM MM MUM I 

Db 4 77 TNIFPLLGDPTSENKTDEKKIEEKKAQI VTEKNTSTKTSNPFLVAAQDSETDYVTTDNLT 536 

Qy 514 KVTEAAVSNMPEGLTPDLVQEACESELNEATGTKIAYETKVDLVQTSEAIQESLYPTAQL 573 

MM MM IIMIMI IIIIIMI I II II 1 1 II I M 1 1 ! II I MMM Ml 

Db 537 KVTEEWANMPEGLTPDLVQEACESELNEVTGTKIAYETKMDLVQTSEVMQESLYPAAQL 596 

QY 574 CPSFEEAEATPSPVLPDIVMEAPLNSLLPSAGASWQPSVSPLEAPPPVSYDSIKLEPEN 633 

IMMhlMMMMMMMMM MMMIhlll Mill Mhlll MM . 

Db 597 CPSFEESEATPSPVLPDIVMEAPLNSAVPSAGASVIQPSSSPLEA-SSVNYESIKHEPEN 655 

Qy 634 PPPYEEAMNVALKAL-GTKEGIKEPESFNAAVQETEAPYISIACDLIKETKLSTEPSPDF 692 

MMMIhhll : I M 1 IIMIIIMIIIIMIIIIIIMM I Ml II 

Db 656 PPPYEEAMSVSLKKVSGI KEEI KEPENINAALQETEAPYISIACDLI KETKLSAEPAPDF 715 

Qy 6 93 SNYSEIAKFEKSVPEHAELVEDSSPESEPVDLFSDDSIPEVPQTQEEAVMLMKESLTEVS 752 

Nihil h ii I Ihl I I ! : ! hi llhllllll I 

Db 716 SDYSEMAKVEQPVPDHSELVEDSSPDSEPVDLFSDDS I PDVPQKQDETVMLVKESLTETS 775 

Qy 753 -ETVAQHK-EERLSASPQELGKPYLESFQPNLHSTKDA-ASNDI PTLTKKEKISLQMEEF 809 

l = = MMM I I IIIIIMM M MM = = = IIMIMI Mill 

Db 776 FESMI EYENKEKLSALPPEGGKPYLESFKLSLDNTKDTLLPDEVSTLSKKEKI PLQMEEL 835 

Qy 810 NTAI YSNDDLLSSKEDKI KESETFSDSSPI EI I DEFPTFVSAKDDS - PKIAKEYTDLEVS 868 

MIMIIMI Ml MMMMMMIMMIMM MM II MIMMIMM 

Db 836 STAVYSNDDLFISKEAQIRETETFSDSSPIEI IDEFPTLISSKTDSFSKLAREYTDLEVS 895 

Qy 869 DKSEIANIQSGADSLPCLELPCDLSFKNI YPK- -DEVHVSDEFSENRSSVSKASISPSNV 926 

MMM II MM III III III II IMMM MM : I M 

Db 8 96 HKSEIANAPDGAGSLPCTELPHDLSLKNIQPKVEEKISFSDDFSKNGSATSKVLLLPPDV 955 

Qy 927 SALEPQTEMGS I VKSKSLTKEAEKKLPSDTEKEDRSLSAVLSAELSKTSWDLLYWRDI K 986 

Ml I M MM I I MMIMMIMIMM IM MMMMMMMIMM 

Db 956 SALATQAEIESIVKPKVLVKEAEKKLPSDTEKEDRSPSAIFSAELSKTSWDLLYWRDIK 1015 

Qy 987 KTGWFGASLFLLLSLTVFS I VSVTAYIALALLSVTI SFRI YKGVIQAI QKSDEGHPFRA 1046 

MMMMMMMMMMMMMMMMMMMMMMIMMMMMIMM 

Db 1016 KTGWFGASLFLLLSLTVFS I VSVTAYIALALLSVTI SFRI YKGVI QAI QKSDEGHPFRA 1075 

Qy 1047 YLESEVAI SEELVQKYSNSALGHVNSTI KELRRLFLVDDLVDSLKFAVLMWVFTYVGALF 1106 

I M II I I II II II M II I I I I I I II II I I II I II I II II I I I II II II I II II II I I II 
Db 1076 YLESEVAI SEELVQKYSNSALGHVNCTI KELRRLFLVDDLVDSLKFAVLMWVFTYVGALF 1135 

Qy 1107 NGLTLL I LAL I SLFS I PVI YERHQVQI DH YLGLANKS VKDAMAKI QAKI PGLKRKAD 1163 

1 1 1 r r 1 1 1 1 1 1 f 1 1 1 = j f 1 1 1 1 1 1 iiiiiiiiiihiiiiiiiiiiiiiiiiiih 

Db 1136 NGLTLL I LAL I SLFS VP VI YERHQAQI DHYLGLANKNVKDAMAKI QAKI PGLKRKAE 1192 



RESULT 4 

US-10-060-036-71 

; Sequence 71, Application US/10060036 

; Publication No. US20030073144A1 

; GENERAL INFORMATION: 

; APPLICANT: Benson, Darin R. 

; APPLICANT: Kalos, Michael D. 

; APPLICANT: Lodes, Michael J. 

; APPLICANT: Persing, David H. 



; APPLICANT: Hepler, William T. 
; APPLICANT: Jiang, Yuqiu 

; TITLE OF INVENTION: COMPOSITIONS AND METHODS FOR THE THERAPY 
; TITLE OF INVENTION: AND DIAGNOSIS OF PANCREATIC CANCER 
; FILE REFERENCE: 210121.566 

CURRENT APPLICATION NUMBER: US/lO/06 0 , 03 6 
; CURRENT FILING DATE: 2002-01-30 
; NUMBER OF SEQ ID NOS : 4560 

SOFTWARE: Fast SEQ for Windows Version 4.0 
; SEQ ID NO 71 

LENGTH: 1192 
TYPE: PRT 

ORGANISM: Homo sapiens 
US-10-060-036-71 



Query Match 75.3%; Score 4403.5; DB 15; Length 1192; 

Best Local Similarity 75.9%; Pred. No. 4.7e-214; 

Matches 909; Conservative 104; Mismatches 145; Indels 39; Gaps 20; 



Qy 


l 


MEDIDQSSLVSSSTDSPPRPPPAFKYQFVTEPEDEEDEEEEEDEEEDDEDLEELEVLERK 


60 


Db 


l 


MMIM Mill IMIII IMIIMI IMIII MMhlhMMMMMMM 

MEDLDQSPLVSSS-DSPPRPQPAFKYQFVREPEDEE-EEEEEEEEDEDEDLEELEVLERK 


58 


Qy 


61 


PAAGLSAAAVP- -PAAAAPLLDFSSDSVPPAPRGPLPAAPPAAPERQPSWERSPAA P 

IIIIIIII II III Mhll : 1 MMMIMIMM lllllllh || : | 


115 


Db 


59 


PAAGLSAAPVPTAPAAGAPLMDFGNDFVPPAPRGPLPAAPPVAPERQPSWDPSPVSSTVP 


118 


Qy 


116 


APSLPPAAAVLPSKLPEDDEPPARPPPPPPAGASPLAE - - PAAPPSTPAAPKR 


166 


Db 


119 


Ml MM IIIIIIMIIIIIIIIIIII II II MMMMIMM 

APSPLSAAAVSPSKLPEDDEPPARPPPPPPASVSPQAEPVWTPPAPAPAAPPSTPAAPKR 


178 


Qy 


167 


RG-SGSVDETLFALPAASEPVI PSSAEKIMDLMEQPGNTVSSGQEDFPSVLLETAASLPS 

1 1 1 1:1 i hi MM III MMIhhIIIIIMMIIIIIIIII 


225 


Db 


179 


RGSSGSVDETLFALPAASEPVIRSSAEN-MDLKEQPGNTISAGQEDFPSVLLETAASLPS 


237 


Qy 


226 


LSPLSTVSFKEHGYLGNLSAVSSSEGTIEETLNEASKELPERATNPFVNRDLAEFSELEY 


285 


Db 


238 


Mill Mill IMIII 1 :||h:| -Mllh hi "III Illllll 

LSPLSAASFKEHEYLGNLSTVLPTEGTLQENVSEASKEVSEKAKTLLIDRDLTEFSELEY 


297 


Qy 


286 


SEMGSSFKGSPKGESAILVENTKEEVIVRSKDKED-LVCSAALHSPQESPVG -KED 
Illllll III 1 1 1 . . 1 1 . II . II . . ! t . 1 II . II. II 1 III 


339 


Db 


298 


1 Ml**l 1 • 1 1 ■ M • ■ M • 1 * II • II- 1 i 1 III 

SEMGSSFSVSPKAESAVIVANPREEI IVKNKDEEEKLVSNNILHNQQELPTALTKLVKED 3 57 


Qy 


34 0 


RWSPEKTMDIFNEMQMSWAPWEEYADFKPFEQAWEVKDTYEGSRDVLAA RANV 


395 


Db 


358 


Ml II 1 III :::| IhlMMIMIIh 11111= 1 1 hill :|: 
EWSSEKAKDSFNEKRVAVEAPMREEYADFKPFERVWEVKDSKEDS -DMLAAGGKI ESNL 


416 


Qy 


396 


ESKVDRKCLEDSLEQKSLGKDSEGRNEDASFPSTPEPVKDSSRAYITCASFT-SATESTT 


454 


Db 


417 


M II h II IMM : MM Ml IMMM Ml 1 IMIII 1 Mill 

ESKVDKKCFADSLEQTNHEKDSESSNDDTSFPSTPEGIKDRSGAYITCAPFNPAATESIA 


476 


Qy 


455 


ANTFPLLEDHTSENKTDEKKIEERKAQI ITEK-TSPKTSNPFLVAVQDSEADYVTTDTLS 
1 1 1 1 1 1 I 1 1 1 1 1 1 1 M 1 1 1 - 1 1 1 1 . 1 1 1 II 1 1 1 1 i II . 


513 


Db 


477 


I I I I I I I I M l - l l l l • l l l || lllllllll 1 1 1 1 1 1 1 1 If | : 

TNIFPLLGDPTSENKTDEKKIEEKKAQI VTEKNTSTKTSNPFLVAAQDSETDYVTTDNLT 536 


Qy 


514 


KVTEAAVSNMPEGLTPDLVQEACESELNEATGTKIAYETKVDLVQTSEAIQESLYPTAQL 573 



Ml MM I Mill II III III III II Mill 1 1 Mhll Mill MM III Ml 



Db 


537 


KVTEEWANMPEGLTPDLVQEACESELNEVTGTKIAYETKMDLVQTSEVMQESLYPAAQL 


-> Z> D 


Ov 


574 


C P S F E EA EA T P S P VL P D I VM F A PT .N ^ T J ■ P ^ AH A C I\ A7D P P T.FAPPPVQVnciTKT.FPFTvT 


£TT 
D Jj 


Db 


597 


MlllhMIIIIIIIIIIMMIM -Mil HIM hi MM 

CP^FEE^EATP^PVLPDT VMFAPT.N^JAVP^AflACJVT OPQQQPT pa QQ^/MVPQTK'RPPPM 


O D D 


Ov 


634 


PPPYEEAMNVALKAL-GTKEGI KEPESFNAAVOETEAPYT APnT.TKFTKT.ciTFPCipnp 


D j_ 


Db 


656 


MINI : 1 II Mill: II hi II MM MUM II Ml Ml 1 1 

PPPYEEAMSVSLKKVSGI KEEI KEPENTNAALOETEAPYT ACDT .T VWTKJ .^AWV>A pnp 


/ ± O 


Qy 


693 


SNYSEI AKFE KS VPEHAELVEDS S PES E P VDLFSDDS I P EVPQTQEEAVMLMKESLTEVS 


752 


Db 


716 


Nihil h 1 1 I.I 1 I 1 M 1 hi hi I'll I 

SDYSEMAKVEQPVPDHSELVEDSSPDSEPVDLFSDDSIPDVPQKQDETVMLVKESLTETS 


775 


Ov 


753 


- ETVAQHK- EERLSAS PQELGKPYLESFQPNLHSTKDA- ASNDI PTLTKKEKI SLQMEEF 
1 1 . 1 1 1 1 1 1 1 1 I I I I I . .1 .ill ... M 


809 


Db 


776 


I 1 * * " 1 * 1 1 1 * * • II : 1 II II 1 1 II 1 

FESMIEYENKEKIjSALPPEGGKPYLESFKLSLDNTKDTLLPDEVSTLSKKEKIPLQMEEL 835 


Qy 


810 


NTAI YSNDDLLSSKEDKI KESETFSDSS P I EI I DEFPTFVSAKDDS - PKLAKEYTDLEVS 


868 


Db 


836 


IHI'll III :h|:||| Mill MINIMI MM || MhMIMIII 
STAVYSNDDLFISKEAQIRETETFSDSSPIEI IDEFPTLISSKTDSFSKLAREYTDLEVS 


895 


Qv 


869 


DKSEIANIQSGADSLPCLELPCDLSFKNI YPK- -DEVHVSDEFSENRSSVSKASISPSNV 


926 


Db 


896 


MUM II MM Ml III III M Ihlhl MM : 1 : | 
HKSEIANAPDGAGSLPCTELPHDLSLKNIQPKVEEKISFSDDFSKNGSATSKVLLLPPDV 


955 


Ov 


927 


SALEPQTEMGSI VKSKSLTKEAEKKLPSDTEKEDRSLSAVLSAELSKTSVVDLLYWRDIK 


986 


Db 


956 


Ml 1 h MM 1 1 MMMIMMIIMM Ih IIIIMIIIIIIIMIMI 

S ALATQAE I ES I VKP KVLVKEAE KKL PSDTEKEDRS PS A 1 FSAELS KTS WDLLYWRD I K 


1015 


Qy 


987 


KTGWFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVIQAIQKSDEGHPFRA 


1046 


Db 


1016 


1 M M M II 1 1 1 M M 1 1 1 II 1 M M 1 II II II II M II 1 II 1 1 1 1 1 M M 1 1 II 1 II II 

KTGWFGASLFLLLSLTVFS I VSVTAYI ALALLS VTI SFRI YKGVI QAI QKSDEGHPFRA 


1075 


Qy 


1047 


YLESEVAI SEELVQKYSNSALGHVNSTI KELRRLFLVDDLVDSLKFAVLMWVFTYVGALF 


1106 


Db 


1076 


M II II M 1 M M 1 II 1 M II II II 1 II II II M II II II II 1 1 1 1 1 II 1 1 1 II 1 II II 

YLESEVAI SEELVQKYSNSALGHVNCTI KELRRLFLVDDLVDSLKFAVLMWVFTYVGALF 


1135 


Qy 


1107 


NGLTLLI LALI SLFS I PVI YERHQVQI DHYLGLANKSVKDAMAKI QAKI PGLKRKAD 1163 
M II 1 M 1 1 II M 1 h 1 II II II 1 


Db 


1136 


NGLTLLI LALI SLFS VPVI YERHQAQI DHYLGLANKNVKDAMAKI QAKI PGLKRKAE 1192 



RESULT 5 
US-09-789-386-2 

; Sequence 2, Application US/09789386 

; Patent No. US20020010324A1 

; GENERAL INFORMATION: 

; APPLICANT: MICHALOVICH, DAVID 

; APPLICANT: PRINJHA, RABINDER KUMAR 

TITLE OF INVENTION: NOVEL COMPOUNDS 

FILE REFERENCE: GP-30165-C1 
; CURRENT APPLICATION NUMBER: US/09/789 , 386 

CURRENT FILING DATE: 2001-02-21 

PRIOR APPLICATION NUMBER: U.K. 9916898.1 

PRIOR FILING DATE: 1999-07-19 

PRIOR APPLICATION NUMBER: U.K. 9816024.5 



PRIOR FILING DATE: 1998-07-22 
PRIOR APPLICATION NUMBER: US 09/359,208 
PRIOR FILING DATE: 1999-07-22 
NUMBER OF SEQ ID NOS : 6 

SOFTWARE: FastSEQ for Windows Version 3.0 
SEQ ID NO 2 
LENGTH: 1192 
TYPE: PRT 

ORGANISM: HOMO SAPIENS 
US-09-789-386-2 

Query Match 75.2%; Score 4398.5; DB 9; Length 1192; 

Best Local Similarity 75.9%; Pred. No. 8.4e-214; 

Matches 908; Conservative 104; Mismatches 146; Indels 39; Gaps 20; 
Qy 1 MEDIDQSSLVSSSTDSPPRPPPAFKYQFVTEPEDEEDEEEEEDEEEDDEDLEELEVLERK 60 

Mhlll Mill llllll MINIM MIMI 

Db 1 MEDLDQSPLVSSS-DSPPRPQPAFKYQFVREPEDEE-EEEEEEEEDEDEDLEELEVLERK 58 

Qy 61 PAAGLSAAAVP- -PAAAAPLLDFSSDSVPPAPRGPLPAAPPAAPERQPSWERSPAA P 115 

I I I ! I 1 I I M III I ! I = f I M MIMIMIIMM I I f I I I I I : M : I 

Db 59 PAAGLSAAPVPTAPAAGAPLMDFGNDFVPPAPRGPLPAAPPVAPERQPSWDPSPVSSTVP 118 

Qy 116 APSLPPAAAVLPSKLPEDDEPPARPPPPPPAGASPLAE PAAPPSTPAAPKR 166 

Ml MM MIMIMMMMIIMM II II MIMMIIMM 

Db 119 APSPLSAAAVSPSKLPEDDEPPARPPPPPPASVSPQAEPVWTPPAPAPAAPPSTPAAPKR 178 

Qy 167 RG-SGSVDETLFALPAASEPVI PSSAEKIMDLMEQPGNTVSSGQEDFPSVLLETAASLPS 225 

II lllllllllllllllllll 1 1 1 1 III MlllhhIIIIIIIIMIIIIIIII 

Db 179 RGSSGSVDETLFALPAASEPVIRSSAEN-MDLKEQPGNTISAGQEDFPSVLLETAASLPS 237 

Qy 22 6 LSPLSTVSFKEHGYLGNLSAVSSSEGTIEETLNEASKELPERATNPFVNRDLAEFSELEY 285 

Mill Mill llllll I I ■ ! I ■ hi :MM I'M 

Db 23 8 LSPLSAASFKEHEYLGNLSTVLPTEGTLQENVSEASKEVSEKAKTLLIDRDLTEFSELEY 2 97 

Qy 286 SEMGSSFKGSPKGESAILVENTKEEVI WSKDKED-LVCSAALHSPQESPVG KED 339 

IMIMI III MhM I : I I : I I : : I I : I : || : lh II I 
Db 2 98 SEMGSSFSVSPKAESAVIVANPREEIIVKNKDEEEKLVSNNILHNQQELPTALTKLVKED 357 

Qy 340 RWSPEKTMDIFNEMQMSWAPVREEYADFKPFEQAWEVKDTYEGSRDVLAA RANV 395 

Ml M I Ml = = = 1 I I : t I I I I I I I II t ^ Mllh I I hill : I : 

Db 358 EWSSEKAIODSFNEKRVAVEAPMREEYADFKPFERVWEVKDSKEDS-DMLAAGGKIESNL 416 

Qy 396 ESKVDRKCLEDSLEQKSLGKDSEGRNEDASFPSTPEPVKDSSRAYITCASFT-SATESTT 454 

I i I II Mill : 1 1 1 1 hi IMIMI Ml llllll I Mill 

Db 417 ESKVDKKCFADSLEQTNHEKDSESSNDDTSFPSTPEGIKDRPGAYITCAPFNPAATESIA 476 

Qy 455 ANTFPLLEDHTSENKTDEKKIEERKAQI ITEK-TSPKTSNPFLVAVQDSEADYVTTDTLS 513 

I MM I IIIIIIIIIMIhMlhlll II 1 1 J 1 1 J I J I I II I llllll h 

Db 477 TNIFPLLGDPTSENKTDEKKIEEKKAQIVTEKNTSTKTSNPFLVAAQDSETDYVTTDNLT 536 

Qy 514 KVTEAAVSNMPEGLTPDLVQEACESELNEATGTKIAYETKVDLVQTSEAIQESLYPTAQL 573 

I I I I hMMMMMMIMMIMI I =111111 Ml 

Db 537 KVTEEWANMPEGLTPDLVQEACESELNEVTGTKIAYETKMDLVQTSEVMQESLYPAAQL 596 

Qy 574 CPSFEEAEATPSPVLPDI VMEAPLNSLLPSAGASVVQPSVSPLEAPPPVSYDSIKLEPEN 633 

MlllhlllilMIIIIIIIIIIII MIMIIhlll Mill hhlll MM 



Db 


597 


CPSFEESEATPSPVLPDIVMEAPLNSAVPSAGASVIQPSSSPLEA-SSWYESIKHEPEN 


O J Z) 


Qy 


634 


PPPYEEAMNVALKAL-GTKEGIKEPESFNAAVQETEAPYISIACDLIKETKLSTEPSPDF 


692 


Db 


656 


Ml' : | || HIM: l-l hi 1 1 I I'' ,| ||:||| 

PPPYEEAMSVSLKKVSGIKEEIKEPENINAALQETEAPYISIACDLIKETKIjSAEPAPDF 


715 


Qy 


693 


snyseiakfeksvpehaelvedsspesepvdlfsddsipevpotoffavmtmkf^t.tfv^ 




Db 


716 


Nihil h 1 1 1 1 1 I 1 1 1 I ! 1 hi llhllllll I 

SDYSEMAKVEOPVPDHSELVEDS^PD^FPVDTtF^DD^T PnVPnicnnT?T^/MT vt^tpqt TPTQ 


I/O 


Ov 


753 


- ETVAOHK- EERLSAS POELGKPYLESFOPNT iH^TKDA - A QNTi T PTT .TW^VJ OMPPP 


C5 U ^ 


Db 


776 


1- - = Mill 1 1 lllllllh H MM : = : lhl|||| ||||| 

FESMI EYENKEKLSALPPEGGKPYLESFKLSLDNTKDTLLPDEVSTLSKKEKI PLQMEEL 


ft 7 R 


Qy 


810 


NTAI YSNDDLLSSKEDKIKESETFSDSSPIEI IDEFPTFVSAKDDS-PKLAKEYTDLEVS 


ft £ft 
O D O 


Db 


836 


MMIIII III MNIIMIMIIIIIII 1 II 1 ■ 1 1 1 

STAVYSNDDLFI SKEAOIRETETFSD^PI EI IDEFPTT.T ^KTnQF^KT.APFVTTYT.'FVc: 


Q qc: 


Ov 


869 


DKSEIANIOSGADSLPCTiRTiPPnTi^FKMT YPK nFVTTV^nFPQFKTP QQVQfcra QT cdcmu 




Db 


896 


Mill II Ml III III III II Ihlhl h II M : 1 

HKSEIANAPDGAGSLPCTELPHDLSLKNIQPKVEEKISFSDDFSKNGSATSKVLLLPPDV 


jjj 


Qy 


927 


SALEPOTEMGS I VKSKSLTKEAEKKLPSDTEKEDRST A VT AFT .^KT^ WT)T ,T .YWRHTK 




Db 


956 


Ill 1 h Ml 1 1 MIMMIMM Ih III WWW IMMI II 

S ALATOAE I E S I VKP KVL VKEAEKKLP S DT E KE DR ^ P ^ A T F ^ A FT , c; K"T c: \ n/HT J , YT/d p n T V 


i ni c; 

IUIj 


Qy 


987 


KTG WFGASLFLLLS LT VFS I VS VTA Y I ALALLSVT I SFR I YKG VI QAIQKSDEGHPFRA 

IIIIIMIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIII 




Db 


1016 


KTG WFGASLFLLLSLTVFS I VS VTA Y I ALALLSVT I SFR I YKG VI QAIQKSDEGHPFRA 


1075 


Qy 


1047 


YLESEVAI SEELVQKYSNSALGHVNSTI KELRRLFLVDDLVDSLKFAVLMWVFTYVGALF 

IMIIMIIIIMIIIMIIIIIII 1 


1106 


Db 


1076 


YLESEVAI SEELVQKYSNSALGHVNCTI KELRRLFLVDDLVDSLKFAVLMWVFTYVGALF 


1135 


Qy 


1107 


NGLTLLI LALI SLFS I PVI YERHQVQI DHYLGLANKS VKDAMAKI QAKI PGLKRKAD 1163 

IIIIIIMMIIIIhMIIIIII , 


Db 


1136 


NGLTLL I LALI SLFSVPVI YERHQAQ I DHYLGLANKNVKDAMAKI QAKI PGLKRKAE 1192 


RESULT 
US-09-£ 


6 

J93-348- 


-23 





; Sequence 23, Application US/09893348 
; Patent No. US20020072493A1 
; GENERAL INFORMATION: 

; APPLICANT: E I SENBACH- SCHWARTZ , Michal 
; APPLICANT: COHEN, I run R. 
; APPLICANT: BESERMAN, Pierre 
; APPLICANT: M0S0NEG0, Alon 
; APPLICANT: MOALEM , Gila 

; TITLE OF INVENTION: ACTIVATED T-CELLS, NERVOUS SYSTEM-SPECIFIC ANTIGENS AND 
THEIR USES 

FILE REFERENCE: EI S -SCHWARTZ =2 A 
; CURRENT APPLICATION NUMBER: US/09/893 , 348 
; CURRENT FILING DATE: 2001-06-28 

PRIOR APPLICATION NUMBER: US 09/314,161 
; PRIOR FILING DATE: 1999-05-19 

PRIOR APPLICATION NUMBER: US 09/218,277 



; PRIOR FILING DATE : 1998-12-22 

; PRIOR APPLICATION NUMBER: PCT/US98 / 14715 

; PRIOR FILING DATE: 1998-07-21 

; PRIOR APPLICATION NUMBER: IL 124500 

; PRIOR FILING DATE: 1998-05-19 

; NUMBER OF SEQ ID NOS : 2 9 

SOFTWARE: Patentln version 3.1 
; SEQ ID NO 23 

LENGTH: 1192 

TYPE : PRT 

ORGANISM: Homo sapiens 
US-09-893-348-23 



Query Match 75.2%; Score 4398-5; DB 9; Length 1192; 

Best Local Similarity 75.9%; Pred. No. 8.4e-214; 

Matches 908; Conservative 104; Mismatches 146; Indels 39; Gaps 20; 
Qy 1 MEDIDQSSLVSSSTDSPPRPPPAFKYQFVTEPEDEEDEEEEEDEEEDDEDLEELEVLERK 60 

Mhlll Mill 1 1 1 1 1 1 IMIIIII MINI , 

Db 1 MEDLDQSPLVSSS-DSPPRPQPAFKYQFVREPEDEE-EEEEEEEEDEDEDLEELEVLERK 58 

Qy 61 PAAGLSAAAVP- -PAAAAPLLDFSSDSVPPAPRGPLPAAPPAAPERQPSWERSPAA P 115 

IIIMIII II III Mhll = 1 IIIIIIIIIMIM IIIMMh II : I 
Db 59 PAAGLSAAPVPTAPAAGAPLMDFGNDFVPPAPRGPLPAAPPVAPERQPSWDPSPVSSTVP 118 

Qy 116 APSLPPAAAVLPSKLPEDDEPPARPPPPPPAGASPLAE PAAPPSTPAAPKR 166 

III 1 1 1 1 I I ! I l I I M II MIMIIMIIII 

Db 119 APSPLSAAAVSPSKLPEDDEPPARPPPPPPASVSPQAEPVWTPPAPAPAAPPSTPAAPKR 178 



Qy 167 RG-SGSVDETLFALPAASEPVT PSSAEKIMDLMEQPGNTVSSGQEDFPSVLLETAASLPS 225 

M lllllllllllllllllll I I I I III 

Db 17 9 RGSSGSVDETLFALPAASEPVI RSSAEN-MDLKEQPGNTI SAGQEDFPSVLLETAASLPS 237 

Qy 226 LSPLSTVSFKEHGYLGNLSAVSSSEGTIEETLNEASKELPERATNPFVNRDLAEFSELEY 285 

Mill Mill llllll I : I I I : : I ==11111= hi = = I I I lllllll 

Db 238 LSPLSAASFKEHEYLGNLSTVLPTEGTLQENVSEASKEVSEKAKTLLIDRDLTEFSELEY 2 97 



Qy 286 SEMGSSFKGS PKGESAI LVENTKEEVI VRSKDKED -LVCSAALHS PQESPVG KED 339 

lllllll III I I I ^ : I I : I I : I I : = I I : I : II = lh || | II 
Db 298 SEMGSSFSVS PKAESAVI VANPREE 1 1 VKNKDEEEKLVSNNI LHNQQELPTALTKLVKED 357 

Qy 340 RWSPEKTMDIFNEMQMSWAPVREEYADFKPFEQAWEVKDTYEGSRDV1AA RANV 395 

Ml M I III ::M IhlMIMIMM: 11111= I I hill 
Db 358 EWSSEKAKDSFNEKRVAVEAPMREEYADFKPFERVWEVKDSKEDS-DMLAAGGKIESNL 416 



Qy 396 ESKVDRKCLEDSLEQKSLGKDSEGRNEDASFPSTPEPVKDSSRAYITCASFT-SATESTT 454 

II HIM : | | | | |:| IIIIIM :|| \\\\\\ | :|||| 

Db 417 ESKVDKKCFADSLEQTNHEKDSESSNDDTSFPSTPEGIKDRPGAYITCAPFNPAATESIA 476 

Qy 455 ANTFPLLEDHTSENKTDEKKI EERKAQI ITEK-TSPKTSNPFLVAVQDSEADYVTTDTLS 513 

i 1 1 1 1 i iiiiiiiiiiiihiiihiii ii iiiiiiiii 1 1 1 1 mill h 

Db 477 TNI FPLLGDPTSENKTDEKKI EEKKAQI VTEKNTSTKTSNPFLVAAQDSETDYVTTDNLT 536 

Qy 514 KVTEAAVSNMPEGLTPDLVQEACESELNEATGTKIAYETKVDLVQTSEAIQESLYPTAQL 573 

MM hllllllllllllllllMIII IMIMIIIhlllllM =111111 III 

Db 537 KVTEEWANMPEGLTPDLVQEACESELNEVTGTKIAYETKMDLVQTSEVMQESLYPAAQL 596 



Ov 


574 


CPSFEEAEATPSPVLPDT VMFA PT .M^T.T.P^ AHA QWOP^VQ dt Fa dddvc: vnc t vi pp™ 
III IIMIII lllilll MIMil :|||||lhlll Mill |:|:||| MM 


C 11 

D JJ 


Db 


597 


CPSFEESEATPSPVLPDI VM RAPT .N^AVP^AflA^VT Dpcj^cjPT.FA _QQ \7T\TVFQTTaJFPFT\I 


(Tec; 
ODD 


Qy 


634 


PPPYEFAMNVALKAL-GTKEGIKEPESFNAAVOETEAPYT CJT ACTYIMTK'FTK'T.QTFPQPnF 




Db 


656 


III : | M llllh IMMIIIMMMIMMIIMM III 

PPPYEEAMSVSLKKVSGIKEEIKEPENTNAAT.OFTFAPYT arm t^ftttt Qappapnp 


n i ^ 

l ±D 


Ov 


693 


SNYSEIAKFEKSVPERAELVED^^PK^FPVnT.F^nnciT PRVPnTnPT?a^/iviT m^pct t^/q 


/ OA 


Db 


716 


1-111111= IMMIIMM Mill MUM Mlh II hi IMMMIII 1 

SDYSEMAKVEOPVPnH^FTiVFD^QPn^FPVTjT .F^nn^T Pn\^POT<rnnFTT7MT \twwct t't^t'C 


H H C 


Ov 


753 


-ETVAOHK-EERLSASPOFT.nKPYTiF^FnPNT.H^TK'riA - AQKinT PTT w^PlfT qt hmppt? 


buy 


Db 


776 


1- MM II 1 1 III llllh M MM ::: ||:||||| ||||| 

FESMI EYEMKEKT.^ATiPPFfiriKPVT.FCIFPCT.QT.'nMTk'nTT.T DnFVQTT QKKT?vt dt nwrfCT 

*• XjlJI 1 X Xj X Xji.NI I VI_J IVXJOii.XJ X X LjUU IV X X XJXjOX* IN 1 iJXjXJlVI X I\XJ 1 XJXJX Uili Vjl XjO lYIVLj IV X r 1 il J|V| P. P. I j 


Old 
OJJ 


Qv 


810 


NTAI YSNDDLLSSKEDKI KE^ETF^D^ciPTFT TDFFPTFVCJ AKT1T1Q - PkT AT/FVTnT WQ 


o bo 


Db 


836 


MIMMIM III MMMIMIIMM MM III MM II IMMMMIM 

STAVYSNDDLFI ^KEAOT RRTFTF^D^^PT FT TTjFFPTT.T Q Q TfTn Q F Q ft ADDVTnT p\7C 


o o c 


Ov 


869 


DKSEIANTO^nAn c ^TiPrTiET 1 PrDTi c JFK'MT YPK - - -TiF\/TT\7^riFFQFKTPcrc\7Ci^A ctcdcmu 


y z b 


Db 


896 


MIMI II MM III III III M ::: : |: || : | : | 

HKSEIANAPDGAGSLPCTELPHDLSLKNTOPKVFFKT ^F^nnF^KTJn^AT^^TT T T ppnu 


QCi: 


Ov 


927 


SALEPOTEMG^ T VK^K^T.TKFAFKKT.PCinTFKFriP QT c a\tt capt C^tcwttit t vtaTDT^iT v 
i - lrtiJJ - ir V x j-iiMV-JO J- v rvo ivoxj x rvrj^Jzj JA.xVUr^oXJ X XliJXCiLJKoXjo/lVXjoArjXjo A.X o V VxJXjXj x WKXJX rv 


Q O (Z 

yob 


Db 


956 


Ml 1 h MM 1 1 1 1 1 1 1 1 h II 1 1 1 1 1 1 1 II 1 I I I I || | 

SALxATOAET E^ T VKPK\7T ■VKFAFK'K'T.PQ'nTFK'FnP QPQA T FQA FT QWTCTAmT T vtaTDT^t y 
iji-ixjzT. x v^^-ixj iLiji v ivr v xj v ivxiLrtiiir^i\_LJir ou x rjrvljXJr\.o x D/t.X V oi-ixjXjo fvX 0 V V XJXjXj x WKU1 a. 


1 m r 


Qy 


987 


KTGWFGASLFLLL^TiTVF^T V^VTAYT AT.AT.T.C!\7TT CJPR T VKTIVT DQ Tfi^QnpfHPCD a 

v — ' v v x xji. xj 1 1 1 Il3u x v l lji vovx jm x X iri.Xxc-i.Xj xj 0 V 1 1 Drill X i\Xj V X I^Hl \J 1 J r A -r H X - ' f KJ-v 

IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIMIIIIII 


1 C\ A C 

X(J4b 


Db 


1016 


KTG WFGAS L F LLLS LTVF S I VS VTA Y I ALALLS VT I S FR I Y KG V I QA I QKS D EGH P FRA 


1075 


Qy 


1047 


YLESEVAI SEELVQKYSNSALGHVNSTI KELRRLFLVDDLVDSLKFAVLMWVFTYVGALF 


1106 


Db 


1076 


lllllllllllllllllllllllll 

YLESEVAI SEELVQKYSNSALGHVNCTI KELRRLFLVDDLVDSLKFAVLMWVFTYVGALF 


1135 


Qy 


1107 


NGLTLL I LAL I SLFSI PVI YERHQVQI DHYLGIjANKSVKDAMAKI QAKI PGLKRKAD 1163 


Db 


1136 


i i i i i i m i i i i : ;i i i i n 1 1 

NGLTLL I LAL I SLFS VPVI YERHQAQI DH YLGLANKN VKDAMAKI QAKI PGLKRKAE 1192 



RESULT 7 

US-10-205-194-164 

Sequence 164, Application US/10205194 
Publication No. US20030134301A1 
GENERAL INFORMATION: 
APPLICANT: Warner-Lambert Company 
APPLICANT: Lee, Kevin 
APPLICANT: Dixon, Alistair 
APPLICANT: Brooksbank, Robert 
APPLICANT: Pinnock, Robert 

TITLE OF INVENTION: Identification and Use of Molecules Implicated in Pain 
FILE REFERENCE: WL-A- 0182 01 

CURRENT APPLICATION NUMBER: US/10/205 , 194 
CURRENT FILING DATE: 5200-07-24 
PRIOR APPLICATION NUMBER: GB 0118354.0 
PRIOR FILING DATE: 2001-07-27 



NUMBER OF SEQ ID NOS : 177 
SOFTWARE: Patent In Ver. 2.1 
SEQ ID NO 164 
LENGTH: 37 9 
TYPE: PRT 

ORGANISM: Rattus norvegicus 
FEATURE : 

OTHER INFORMATION: Foocen-m2 reticulon 
US-10-205-194-164 

Query Match 25.9%; Score 1513; DB 12; Length 379; 

Best Local Similarity 32.6%; Pred. No. 8.1e-69; 

Matches 379; Conservative 0; Mismatches 0; Indels 784; Gaps 1 
Qy 1 MEDIDQSSLVSSSTDSPPRPPPAFKYQFVTEPEDEEDEEEEEDEEEDDEDLEELEVLERK 60 

IIMIIIIIIIIMIIilllllMIMIIIIIIMIIIIIIIIIIIIMMIMIIIIII 

Db 1 MEDIDQSSLVSSSTDSPPRPPPAFKYQFVTEPEDEEDEEEEEDEEEDDEDLEELEVLERK 60 

Qy 61 PAAGLSAAAVPPAAAAPLLDFSSDSVPPAPRGPLPAAPPAAPERQPSWERSPAAPAPSLP 12 0 

IMIMMIIIIMIIIIIIIIIIIIMIIIIIIIIMIMIIIIIIIIIIIIIIIIIM 

Db 61 PAAGLSAAAVPPAAAAPLLDFSSDSVPPAPRGPLPAAPPAAPERQPSWERSPAAPAPSLP 120 

Qy 121 PAAAVLPSKLPEDDEPPARPPPPPPAGASPLAEPAAPPSTPAAPKRRGSGSVDETLFALP 180 

IIIIIIIIIMIIIMMMMMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 

Db 121 PAAAVLPSKLPEDDEPPARPPPPPPAGASPLAEPAAPPSTPAAPKRRGSGSVDETLFALP 180 

Qy 181 AASEPVI PSSAEKIMDLMEQPGNTVSSGQEDFPSVLLETAASLPSLSPLSTVSFKEHGYL 240 

MINIM 

Db 181 AASEPVI PSSA 191 

Qy 241 GNLSAVSSSEGTIEETLNEASKELPERATNPFVNRDLAEFSELEYSEMGSSFKGSPKGES 300 
Db 192 191 

QY 301 AILVENTKEEVIVRSKDKEDLVCSAALHSPQESPVGKEDRWSPEKTMDIFNEMQMSWA 360 
Db 192 , i 91 

Qy 3 61 PWEEYADFKPFEQAWEVKDTYEGSRDVTiAARANVESKVDRKCLEDSLEQKSLGKDSEGR 42 0 
Db 192 191 

Qy 421 NEDASFPSTPEPVKDSSRAYITCASFTSATESTTANTFPLLEDHTSENKTDEKKIEERKA 48 0 
Db 192 191 

Qy 481 QIITEKTSPKTSNPFLVAVQDSEADYVTTDTLSKVTEAAVSNMPEGLTPDLVQEACESEL 54 0 
Db 192 19! 

Qy 541 NEATGTKIAYETKVDLVQTSEAIQESLYPTAQLCPSFEEAEATPSPVLPDIVMEAPLNSL 600 
Db 192 191 

Qy 601 LPSAGASWQPSVSPLEAPPPVSYDSIKLEPENPPPYEEAMNVALKALGTKEGIKEPESF 660 



Qy 



661 NAAVQETEAPYISIACDLIKETKLSTEPSPDFSNYSEIAKFEKSVPEHAELVEDSSPESE 720 



Db 


192 


Qy 


721 


Db 


192 


Qy 


781 


Db 


192 


Qy 


841 


Db 


192 


Qy 


901 


Db 


192 


Qy 


961 


Db 


192 


Qy 


1021 


Db 


237 


Qy 


1081 


Db 


297 


Qy 


1141 


Db 


357 



191 



igi 

781 NLHSTKDAASNDI PTLTKKEKI SLQMEEFNTAI YSNDDLLSSKEDKI KESETFSDSSPI E 84 0 
191 

841 IIDEFPTFVSAKDDSPK1AKEYTDLEVSDKSEIANIQSGADSLPCLELPCDLSFKNIYPK 900 

___ 191 



191 



MIMIIIIIIIIIIIIIMIMMIIMIIIIIIIIIMUIII 

WDLLYWRDI KKTGWFGASLFLLLSLTVFS I VSVTAYI ALALLS 

VTI SFRI YKGVI QAI QKSDEGHPFRAYLESEVAI SEELVQKYSNSALGHVNSTI KELRRL 

I 

WISFRIYKGVI QAI QKSDEGHPFRAYLESEVAI SEELVQKYSNSALGHVNSTI KELRRL 
FLVDDLVDS LKFAVLMW VFT Y VGALFNGLTLL I LAL I SLFS I PVI YERHQVQ I DHYLGLA 

. . : i 

FLVDDLVDS LKFAVLMW VFT Y VGALFNGLTLL I LAL I SLFS I PVI YERHQVQI DHYLGLA 

NKS VKDAMAKI QAKI PGLKRKAD 1163 
lllllllllllllllllllllll 



RESULT 8 

US-09-893-348-20 

Sequence 20, Application US/09893348 
Patent No. US200200724 93A1 
GENERAL INFORMATION: 
APPLICANT : EI SENBACH- SCHWARTZ , Michal 
COHEN, I run R. . 
BESERMAN , Pierre 
MOSONEGO, Alon 
MOALEM, Gila 

TITLE OF INVENTION: ACTIVATED T-CELLS, NERVOUS SYSTEM-SPECIFIC ANTIGENS AND 
THEIR USES 

FILE REFERENCE: EI S -SCHWARTZ- 2 A 
CURRENT APPLICATION NUMBER: US/09/8 93,348 
CURRENT FILING DATE: 2001-06-28 
PRIOR APPLICATION NUMBER: US 09/314,161 
PRIOR FILING DATE: 1999-05-19 
PRIOR APPLICATION NUMBER: US 09/218,277 
PRIOR FILING DATE: 1998-12-22 



APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 



; PRIOR APPLICATION NUMBER: PCT/US98/ 147 15 

; PRIOR FILING DATE: 1998-07-21 

; PRIOR APPLICATION NUMBER: IL 124500 

; PRIOR FILING DATE: 1998-05-19 

; NUMBER OF SEQ ID NOS : 29 

SOFTWARE: Patent In version 3 . 1 
; SEQ ID NO 2 0 

LENGTH: 3 60 

TYPE : PRT 

ORGANISM: Rattus norvegicus 
US-09-893-348-20 

Query Match 24.1%; Score 1411.5; DB 9; Length 360; 

Best Local Similarity 31.0%; Pred. No. le-63; 

Matches 360; Conservative 0; Mismatches 0; Indels 8 03; Gaps 1; 
Qy 1 MEDIDQSSLVSSSTDSPPRPPPAFKYQFVTEPEDEEDEEEEEDEEEDDEDLEELEVLERK 60 

IIIIIIIIIMIIIIMIIIIIIIIIMMIIIIIIIIIIIMIIIIIIIIMIIIIIII 

Db 1 MEDIDQSSLVSSSTDSPPRPPPAFKYQFVTEPEDEEDEEEEEDEEEDDEDLEELEVLERK 60 

Qy 61 PAAGLSAAAVPPAAAAPLLDFSSDSVPPAPRGPLPAAPPAAPERQPSWERSPAAPAPSLP 120 

M 1 1 Ml 1 1 1 1 II 1 1 1 1 1 Ml 1 1 Ml 1 1 1 Ml 1 1 M 1 1 1 1 MM 1 1 1 1 M 1 1 1 Ml II 

Db 61 PAAGLSAAAVPPAAAAPLLDFSSDSVPPAPRGPLPAAPPAAPERQPSWERSPAAPAPSLP 120 

Qy 121 PAAAVLPSKLPEDDEPPARPPPPPPAGASPLAEPAAPPSTPAAPKRRGSGSVDETLFALP 180 



Db 121 PAAAVLPSKLPEDDEPPARPPPPPPAGASPLAEPAAPPSTPAAPKRRGSGSV 172 

Qy 181 AASEPVIPSSAEKIMDLMEQPGNTVSSGQEDFPSVLLETAASLPSLSPLSTVSFKEHGYL 240 

Db 173 172 

Qy 241 GNLSAVSSSEGTIEETLNEASKELPERATNPFVNRDLAEFSELEYSEMGSSFKGSPKGES 3 00 

Db 173 172 

Qy 301 AILVENTKEEVIVRSKDKEDLVCSAALHSPQESPVGKEDRWSPEKTMDIFNEMQMSWA 360 

Db 173 172 

Qy 361 PVREEYADFKPFEQAWEVKDTYEGSRDVLAARANVESKVDRKCLEDSLEQKSLGKDSEGR 420 

Db 173 172 

Qy 421 NEDASFPSTPEPVKDSSRAYITCASFTSATESTTANTFPLLEDHTSENKTDEKKIEERKA 480 

Db 173 172 

Qy 481 QIITEKTSPKTSNPFLVAVQDSEADYVTTDTLSKVTEAAVSNMPEGLTPDLVQEACESEL 540 

Db 173 172 

Qy 541 NEATGTKIAYETKVDLVQTSEAIQESLYPTAQLCPSFEEAEATPSPVLPDIVMEAPLNSL 600 

Db 173 172 

Qy 601 LPSAGASWQPSVSPLEAPPPVSYDSIKLEPENPPPYEEAMNVALKALGTKEGIKEPESF 660 



Qy 



661 NAAVQETEAPYISIACDLIKETKLSTEPSPDFSNYSEIAKFEKSVPEHAELVEDSSPESE 72 0 



Db 173 172 

Qy 721 PVDLFSDDSI PEVPQTQEEAVMLMKESLTEVSETVAQHKEERLSASPQELGKPYLESFQP 780 

Db 173 172 

Qy 781 NLHSTKDAASNDI PTLTKKEKI SLQMEEFNTAI YSNDDLLSSKEDKI KESETFSDSSPI E 84 0 

Db 173 172 

Qy 841 1 1 DEFPTFVSAKDDS PKLAKEYTDLEVSDKSEI ANI QSGADSLPCLELPCDLSFKNI YPK 900 

Db 173 172 

Qy 901 DEVHVSDEFSENRSSVSKASISPSNVSALEPQTEMGSIVKSKSLTKEAEKKLPSDTEKED 960 

Db 173 172 



Qy 961 RSLSAVLSAELSKTSWDLLYWRDIKKTGWFGASLFLLLSLTVFSIVSVTAYIALALLS 1020 

1 1 1 II M M I MM 1 1 1 1 1 1 Ml 1 1 1 1 M Ml II I II II I 1 1 M 

Db 173 VVDLLYWRDIKKTGWFGASLFLLLSLTVFSI VSVTAYIALALLS 217 

Qy 1021 VT I SFR I YKGVI QAI QKSDEGHPFRAYLESEVAI SEELVQKYSNSALGHVNSTI KELRRL 1080 

Db 218 VTISFRI YKGVI QAI QKSDEGHPFRAYLESEVAI SEELVQKYSNSALGHVNSTI KELRRL 277 

Qy 1081 FLVDDLVDSLKFAVLMWVFTYVGALFNGLTLLILALISLFSIPVIYERHQVQIDHYLGLA 1140 

■ , . , Ml 

Db 278 FLVDDLVDSLKFAVLMWVFTYVGALFNGLTLLI LALI SLFS I PVI YERHQVQI DHYLGLA 337 

Qy 1141 NKSVKDAMAKIQAKI PGLKRKAD 1163 

I Ml II Ml II I Ml I M MM 

Db 338 NKSVKDAMAKIQAKI PGLKRKAD 360 



RESULT 9 
US-09-789-386-6 

; Sequence 6, Application US/09789386 

; Patent No: US20020010324A1 

; GENERAL INFORMATION: 

; APPLICANT: MICHALOVICH, DAVID 

; APPLICANT: PRINJHA, RABINDER KUMAR 

; TITLE OF INVENTION: NOVEL COMPOUNDS 

; FILE REFERENCE: GP-30165-C1 

CURRENT APPLICATION NUMBER: US/ 09/789 , 386 
; CURRENT FILING DATE: 2001-02-21 
; PRIOR APPLICATION NUMBER: U.K. 9916898.1 
; PRIOR FILING DATE: 1999-07-19 

PRIOR APPLICATION NUMBER: U.K. 9816024.5 

PRIOR FILING DATE: 1998-07-22 

PRIOR APPLICATION NUMBER : US 09/359,208 
; PRIOR FILING DATE: 1999-07-22 



NUMBER OF SEQ ID NOS : 6 

SOFTWARE: FastSEQ for Windows Version 3.0 
SEQ ID NO 6 
LENGTH: 373 
TYPE : PRT 

ORGANISM: HOMO SAPIENS 
US-09-789-386-6 

Query Match 20.4%; Score 1191; DB 9; Length 373; 

Best Local Similarity 27.8%; Pred. No. 1.4e-52; 

Matches 327; Conservative 12; Mismatches 20; Indels 818; Gaps 7, 
Qy 1 MEDIDQSSLVSSSTDSPPRPPPAFKYQFVTEPEDEEDEEEEEDEEEDDEDLEELEVLERK 60 

1 Mill MINI MINIM MUM 1 1 II h I h M 1 1 1 M 1 1 1 1 1 1 1 

Db 1 MEDLDQSPLVSSS-DSPPRPQPAFKYQFVREPEDEE-EEEEEEEEDEDEDLEELEVLERK 58 

Qy 61 PAAGLSAAAVP- -PAAAAPLLDFSSDSVPPAPRGPLPAAPPAAPERQPSWERSPAA P 115 

I M M I M II III I I I " I I : I I I II II II I II II I IMIMIh I I : I 

Db 59 PAAGLSAAPVPTAPAAGAPLMDFGNDFVPPAPRGPLPAAPPVAPERQPSWDPSPVSSTVP 118 

Qy 116 APSLPPAAAVLPSKLPEDDEPPARPPPPPPAGASPLAE PAAPPSTPAAPKR 166 

III 1 1 1 1 I INI! MINIM Mlil II II lllllllllllll 

Db 119 APSPLSAAAVSPSKLPEDDEPPARPPPPPPASVSPQAEPVWTPPAPAPAAPPSTPAAPKR 178 

Qy 167 RGSGSVDETLFALPAASEPVI PSSAEKIMDLMEQPGNTVSSGQEDFPSVLLETAASLPSL 22 6 
III 

Db 179 RGSS 182 

Qy 227 SPLSTVSFKEHGYLGNLSAVSSSEGTIEETLNEASKELPERATNPFVNRDLAEFSELEYS 286 

Db 183 182 

Qy 287 EMGS S FKGS PKGESAI LVENTKEEVI VRSKDKEDLVCSAALHS PQES P VGKEDR WS PEK 346 

Db 183 182 

Qy 347 TMDIFNEMQMSWAPWEEYADFKPFEQAWEVKDTYEGSRDVLAARANVESKVDRKCLED 4 06 

Db 183 182 

Qy 407 SLEQKSLGKDSEGRNEDASFPSTPEPVKDSSRAYITCASFTSATESTTANTFPLLEDHTS 466 

Db 183 182 

Qy 467 ENKTDEKKI EERKAQI ITEKTSPKTSNPFLVAVQDSEADYVTTDTLSKVTEAAVSNMPEG 526 

Db 183 182 

Qy 527 LTPDLVQEACESELNEATGTKIAYETKVDLVQTSEAIQESLYPTAQLCPSFEEAEATPSP 586 

Db 183 182 

Qy 58 7 VLPDIVMEAPLNSLLPSAGASWQPSVSPLEAPPPVSYDSIKLEPENPPPYEEAMNVALK 64 6 

Db 183 182 

Qy 647 ALGTKEGIKEPESFNAAVQETEAPYISIACDLIKETKLSTEPSPDFSNYSEIAKFEKSVP 706 



Db 


183 


Qy 


707 


Db 


183 


Qy 


767 


Db 


183 


Qy 


827 


Db 


183 


Qy 


887 


Db 


183 


Qy 


947 


Db 


186 


Qy 


1007 


Db 


217 


Qy 


1067 


Db 


277 


Qy 


1127 


Db 


337 



182 



182 



182 



182 



-GSV 185 



IIMIIIIIIIIIIIIIIIIIIMIIIIIII 
WDLLYWRDI KKTGVVFGASLFLLLSLTVFS 

VS VTAYI ALALLS VTI SFRI YKGVI QAI QKSDEGHPFRAYLESEVAI SEELVQKYSNSA 

IMMIIIIIIIIIilMIIIIIIIIIIIIIIIIIIIIMIMIIIilllllllMIM 

VSVTAYIALALLSVTI SFRI YKGVI QAI QKSDEGHPFRAYLESEVAI SEELVQKYSNSA 
iGH VNS T I KELRRLFL VDDL VDS LKFAVLMWVFT YVGALFNGLTLL I LALI SLFS I PVI Y 

II 1 1 1 1 1 1 1 1 1 Ml 1 1 1 1 1 !h 1 1 1 1 IN 1 1 1 1 1 1 1 1 M 1 1 M I hi I! 

tGHVNCTI KELRRLFLVDDLVDS LKFAVLMWVFT YVGALFNGLTLL I LALI SLFSVPVI Y 
iRHQVQ I DH YLGLANKS VKDAMAKI QAKI PGLKRKAD 1163 

III M = 1 1 M 1 1! 1 : 1 1 1 1 1 1 1 ! I II 1 1 : 



RESULT 10 
US-09-765-205-6 

; Sequence 6, Application US/09765205 
; Patent No. US20020034800A1 
; GENERAL INFORMATION: 
; APPLICANT: Cao , Li 

; TITLE OF INVENTION: BONE MARROW SECRETED PROTEINS AND POLYNUCLEOTIDES 
; FILE REFERENCE: 14 58-004/200130.449 
; CURRENT APPLICATION NUMBER: US/09/765,205 
CURRENT FILING DATE: 2001-01-17 
PRIOR APPLICATION NUMBER: US/09/2 12 , 440 
; PRIOR FILING DATE: 1998-12-16 
; NUMBER OF SEQ ID NOS : 46 

SOFTWARE: FastSEQ for Windows Version 3.0 
; SEQ ID NO 6 

LENGTH: 373 
TYPE : PRT 
ORGANISM: human 
US-09-765-205-6 



Query Match 



20.4%; Score 1191; DB 9; Length 373; 



Best Local Similarity 27.8%; Pred. No. 1.4e-52; 

Matches 32 7; Conservative 12; Mismatches 20; Indels 818; Gaps 7; 

Qy 1 MEDIDQSSLVSSSTDSPPRPPPAFKYQFVTEPEDEEDEEEEEDEEEDDEDLEELEVLERK 60 

llhlll Mill llllll IIIIMII llllll 1 1 1 1 : M ^ 1 1 1 1 1 1 M 1 1 1 1 

Db 1 MEDLDQSPLVSSS-DSPPRPQPAFKYQFVREPEDEE-EEEEEEEEDEDEDLEELEVLERK 58 

Qy 61 PAAGLSAAAVP- -PAAAAPLLDFSSDSVPPAPRGPLPAAPPAAPERQPSWERSPAA P 115 

IIIIMII II III llhll = 1 IIIIIMIIIIIII IIIMII || : I 

Db 59 PAAGLSAAPVPTAPAAGAPLMDFGNDFVPPAPRGPLPAAPPVAPERQPSWDPSPVSSTVP 118 

Qy 116 APSLPPAAAVLPSKLPEDDEPPARPPPPPPAGASPLAE PAAPPSTPAAPKR 166 

III II II IIIIIIIIIIIIIIIIIIM II II lllllllllllll 

Db 119 APSPLSAAAVSPSKLPEDDEPPARPPPPPPASVSPQAEPVWTPPAPAPAAPPSTPAAPKR 178 

Qy 167 RGSGSVDETLFALPAASEPVIPSSAEKIMDLMEQPGNTVSSGQEDFPSVLLETAASLPSL 226 

Db 179 RGSS 182 

Qy 227 SPLSTVSFKEHGYLGNLSAVSSSEGTIEETLNEASKELPERATNPFVNRDLAEFSELEYS 286 

Db 183 182 

Qy 287 EMGSSFKGSPKGESAILVENTKEEVI WSKDKEDLVCSAALHSPQESPVGKEDRVVSPEK 346 

Db 183 182 

Qy 347 TMDIFNEMQMSWAPWEEYADFKPFEQAWEVKDTYEGSRDVIAARANVESKVXiRKCLED 4 06 

Db ' 183 182 

Qy 407 SLEQKSLGKDSEGRNEDASFPSTPEPVKDSSRAYITCASFTSATESTTANTFPLLEDHTS 466 

Db 183 182 

Qy 467 ENKTDEKKIEERKAQI ITEKTSPKTSNPFLVAVQDSEADYVTTDTLSKVTEAAVSNMPEG 526 

Db 183 182 

Qy 527 LTPDLVQEACESELNEATGTKIAYETKVDLVQTSEAIQESLYPTAQLCPSFEEAEATPSP 586 

Db 183 182 

Qy 587 VLPDIVMEAPLNSLLPSAGASWQPSVSPLEAPPPVSYDSIKLEPENPPPYEEAMNVALK 646 

Db 183 182 

Qy 647 ALGTKEGIKEPESFNAAVQETEAPYISIACDLIKETKLSTEPSPDFSNYSEIAKFEKSVP 706 

Db 183 182 

Qy 7 07 EHAELVEDSSPESEPVDLFSDDSIPEVPQTQEEAVMLMKESLTEVSETVAQHKEERLSAS 766 

Db 183 182 

Qy 767 PQELGKPYLESFQPNLHSTKDAASNDIPTLTKKEKISLQMEEFNTAI YSNDDLLSSKEDK 826 



Db 



183 



182 



Qy 


827 


Db 


183 


Ov 


887 


Db 


183 


Ov 


947 


Db 


186 


Ov 


1007 


Db 


217 


Qy 


1067 


Db 


277 


Qy 


1127 


Db 


337 



182 



-GSV-- 185 



Mlllllllllllllllllllllllllllll 

- WDLLYWRDI KKTGWFGASLFLLLSLTVFS 216 



IIMIMIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIMIIIIIIIIIIII 

VSVTAYIALALLSVTISFRIYKGVIQAIQKSDEGHPFRAYLESEVAISEELVQKYSNSA 
jGHVNSTI KELRRLFLVDDLVDSLKFAVLMWVFTYVGALFNGLTLLI LALI SLFS IP VI Y 

INI llllllllllllllllllllllllllllllllllllllllllllllllhllll 

iGHVNCTIKELRRLFLVDDLVDSLKFAVLMWVFTYVGALFNGLTLLILALI SLFSVPVI Y 

IRHQVQ I DHYLGIANKS VKDAMAKI QAKI PGLKRKAD 1163 
Ml llllllllllhllllllllllllllllllh 



RESULT 11 
US-09-893-348-24 

Sequence 24, Application US/09893348 
Patent No. US20020072493A1 
GENERAL INFORMATION: 
APPLICANT: E I SENBACH- SCHWARTZ , Michal 
COHEN, I run R. 
BESERMAN, Pierre 
MOSONEGO, Alon 
MOALEM , Gila 

TITLE OF INVENTION: ACTIVATED T-CELLS , NERVOUS SYSTEM-SPECIFIC ANTIGENS AND 
THEIR USES 

FILE REFERENCE: EIS-SCHWARTZ=2A 
CURRENT APPLICATION NUMBER: US/09/8 93 , 348 
CURRENT FILING DATE: 2001-06-28 
PRIOR APPLICATION NUMBER: US 09/314,161 
PRIOR FILING DATE: 19 99-05-19 
PRIOR APPLICATION NUMBER: US 09/218,277 
PRIOR FILING DATE: 1998-12-22 
PRIOR APPLICATION NUMBER : PCT/US98/ 14 715 
PRIOR FILING DATE: 1998-07-21 
PRIOR APPLICATION NUMBER: IL 124500 
PRIOR FILING DATE: 1998-05-19 
NUMBER OF SEQ ID NOS : 2 9 
SOFTWARE: Patentln version 3.1 
SEQ ID NO 24 
LENGTH: 373 
TYPE : PRT 

ORGANISM: Homo sapiens 
US-09-893-348-24 



APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 



Query Match 20.4%; Score 1191; DB 9; Length 373; 

Best Local Similarity 27.8%; Pred. No. 1.4e-52; 

Matches 327; Conservative 12; Mismatches 20; Indels 818; Gaps 7 
Qy 1 MEDIDQSSLVSSSTDSPPRPPPAFKYQFVTEPEDEEDEEEEEDEEEDDEDLEELEVLERK 60 

Mhlll Mill MINI MINIM Mill! 1 1 1 1 h I h M 1 1 II 1 1 1 1 1 1 1 L 

Db 1 MEDLDQSPLVSSS-DSPPRPQPAFKYQFVREPEDEE-EEEEEEEEDEDEDLEELEVLERK 58 
Qy 61 PAAGLSAAAVP- -PAAAAPLLDFSSDSVPPAPRGPLPAAPPAAPERQPSWERSPAA P 115 

MM I I III 1 1 h 1 1 M I M II I M II M II 1 1 II I II h II = I 

Db 59 PAAGLSAAPVPTAPAAGAPLMDFGNDFVPPAPRGPLPAAPPVAPERQPSWDPSPVSSTVP 118 
Qy 116 APSLPPAAAVLPSKLPEDDEPPARPPPPPPAGASPLAE PAAPPSTPAAPKR 166 

III 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II II lllllllllllll 

Db 119 APSPLSAAAVSPSKLPEDDEPPARPPPPPPASVSPQAEPVWTPPAPAPAAPPSTPAAPKR 178 

Qy 167 RGSGSVDETLFALPAASEPVI PSSAEKIMDLMEQPGNTVSSGQEDFPSVLLETAASLPSL 226 
III 

Db 179 RGSS 182 

Qy 227 SPLSTVSFKEHGYLGNLSAVSSSEGTIEETLNEASKELPERATNPFVNRDLAEFSELEYS 286 

Db 183 182 

Qy 287 EMGSSFKGSPKGESAILVENTKEEVIVRSKDKEDLVCSAALHSPQESPVGKEDRWSPEK 346 

Db 183 182 

Qy 347 TMDIFNEMQMSWAPVREEYADFKPFEQAWEVKDTYEGSRDVLAARANVESKVDRKCLED 406 

Db 183 182 

Qy 407 SLEQKSLGKDSEGRNEDASFPSTPEPVKDSSRAYITCASFTSATESTTANTFPLLEDHTS 466 

Db 183 182 

Qy 467 ENKTD E KKI E ERKAQ 1 1 TE KTS P KTSN P F LVAVQDS EAD Y VTTDTLS KVTEAA VSNM P EG 526 

Db 183 182 

Qy 527 LTPDLVQEACESELNEATGTKIAYETKVDLVQTSEAIQESLYPTAQLCPSFEEAEATPSP 586 

Db 183 182 

Qy 587 VLPDIVMEAPLNSLLPSAGASWQPSVSPLEAPPPVSYDSIKLEPENPPPYEEAMNVALK 646 

Db 183 182 

Qy 647 ALGTKEGIKEPESFNAAVQETEAPYISIACDLIKETKLSTEPSPDFSNYSEIAKFEKSVP 706 

Db 183 182 

Qy 707 EHAELVEDSSPESEPVDLFSDDSIPEVPQTQEEAVMLMKESLTEVSETVAQHKEERLSAS 766 



Qy 



767 PQELGKPYLESFQPNLHSTKDAASNDIPTLTKKEKISLQMEEFNTAIYSNDDLLSSKEDK 826 



Db 


183 


Qy 


827 


Db 


183 


Qy 


887 


Db 


183 


Qy 


947 


Db 


186 


Qy 


1007 


Db 


217 


Qy 


1067 


Db 


277 


Qy 


1127 


Db 


337 


RESULT 


12 


US-10- 


060-036 



182 



182 



GSV 185 

EAEKKLPSDTEKEDRSLSAVLSAELSKTS WDLLYWRDI KKTGWFGASLFLLLSLTVFS 1006 

IMIIIIIIIIIIIIIIIIMIIIIIIIIII 

WDLLYWRDI KKTGWFGASLFLLLSLTVFS 216 

IVSVTAYIALALLSVTISFRIYKGVIQAIQKSDEGHPFRAYLESEVAISEELVQKYSNSA 106€ 

MIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIMMII 

IVSVTAYIALALLSVTISFRIYKGVIQAIQKSDEGHPFRAYLESEVAISEELVQKYSNSA 276 
LGHVNSTI KELRRLFLVDDLVDSLKFAVLMWVFTYVGALFNGLTLLI LALI SLFS I PVI Y 1 12 € 

Mill llllllllllllllllllllllllllllllllllllllllllllllllhllll 

LGHVNCTI KELRRLFLVDDLVDSLKFAVLMWVFTYVGALFNGLTLLI LALI SLFS VPVI Y 336 

ERHQVQI DHYLGLANKS VKDAMAKI QAKI PGLKRKAD 1163 
I I I I I 



Sequence 72, Application US/10060036 
Publication No. US20030073144A1 
GENERAL INFORMATION: 
APPLICANT: Benson, Darin R. 
APPLICANT: Kalos, Michael D. 
APPLICANT: Lodes , Michael J . 
APPLICANT: Persing, David H. 
APPLICANT: Hepler, William T. 
APPLICANT: Jiang, Yuqiu 

TITLE OF INVENTION: COMPOSITIONS AND METHODS FOR THE THERAPY 
TITLE OF INVENTION: AND DIAGNOSIS OF PANCREATIC CANCER 
FILE REFERENCE: 210121.566 

CURRENT APPLICATION NUMBER: US/ 10/ 06 0 , 036 
CURRENT FILING DATE: 2002-01-30 
NUMBER OF SEQ ID NOS: 4560 
SOFTWARE: FastSEQ for Windows Version 4.0 
SEQ ID NO 72 
LENGTH: 3 73 
TYPE : PRT 

ORGANISM: Homo sapiens 
US-10-060-036-72 

Query Match 20.4%; Score 1191; DB 15; Length 373; 

Best Local Similarity 27.8%; Pred. No. 1.4e-52; 

Matches 327; Conservative 12; Mismatches 20; Indels 818; Gaps 7; 



Qy 1 MEDIDQSSLVSSSTDSPPRPPPAFKYQFVTEPEDEEDEEEEEDEEEDDEDLEELEVLERK 60 

NIMH Mill MINI MINIM MINI I II I h I h : 1 1 1 1 1 1 II I II 1 1 

Db 1 MEDLDQSPLVSSS-DSPPRPQPAFKYQFWEPEDEE-EEEEEEEEDEDEDLEELEVLERK 58 

Qy 61 PAAGLSAAAVP--PAAAAPLLDFSSDSVPPAPRGPLPAAPPAAPERQPSWERSPAA P 115 

MINIM II III MINI = I I M I.I lllllllh II : I 

Db 59 PAAGLSAAPVPTAPAAGAPLMDFGNDFVPPAPRGPLPAAPPVAPERQPSWDPSPVSSTVP 118 

Qy 116 APSLPPAAAVLPSKLPEDDEPPARPPPPPPAGASPLAE PAAPPSTPAAPKR 166 

IN INI NIIIIIIIIIIIMIMM N II Mill: 

Db 119 APSPLSAAAVSPSKLPEDDEPPARPPPPPPASVSPQAEPVWTPPAPAPAAPPSTPAAPKR 178 

Qy 167 RGSGSVDETLFALPAASEPVI PSSAEKIMDLMEQPGNTVSSGQEDFPSVLLETAASLPSL 226 

Db 179 RGSS 182 

Qy 227 SPLSTVSFKEHGYLGNLSAVSSSEGTIEETLNEASKELPERATNPFVNRDLAEFSELEYS 286 

Db 183 182 

Qy 287 EMGSSFKGSPKGESAILVENTKEEVIVRSraKEDLVCSAALHSPQESPVGKEDRWSPEK 346 

Db 183 182 

Qy 347 TMDIFNEMQMSWAPWEEYADFKPFEQAWEVraTYEGSRDVLAARANVESKVDRKCLED 4 06 

Db 183 182 

Qy 407 SLEQKSLGKDSEGRNEDASFPSTPEPVKDSSRAYITCASFTSATESTTANTFPLLEDHTS 466 

Db 183 182 

Qy 467 ENKTDEKKIEERKAQI ITEKTSPKTSNPFLVAVQDSEADYVTTDTLSKVTEAAVSNMPEG 526 

Db 183 182 

Qy 527 LTPDLVQEACESELNEATGTKIAYETKVDLVQTSEAIQESLYPTAQLCPSFEEAEATPSP 586 

Db 183 182 

Qy 587 VLPDI VMEAPLNSLLPSAGASWQPSVSPLEAPPPVSYDSIKLEPENPPPYEEAMNVALK 646 

Db 183 182 

Qy 647 ALGTKEGIKEPESFNAAVQETEAPYISIACDLIKETKLSTEPSPDFSNYSEIAKFEKSVP 706 

Db 183 182 

Qy 707 EHAELVEDSSPESEPVDLFSDDSI PEVPQTQEEAVMLMKESLTEVSETVAQHKEERLSAS 766 

Db 183 182 

Qy 767 PQELGKPYLESFQPNLHSTKDAASNDI PTLTKKEKISLQMEEFNTAI YSNDDLLSSKEDK 826 

Db 183 182 

Qy 827 IKESETFSDSSPIEIIDEFPTFVSAKDDSPKLAKEYTDLEVSDKSEIANIQSGADSLPCL 886 



Qy 


887 


Db 


183 


Qy 


947 


Db 


186 


Qy 


1007 


Db 


217 


Qy 


1067 


Db 


277 


Qy 


1127 


Db 


337 



Ih 

-GSV 185' 



1 1 II I !! Ill Ml II II I II Ml IL 1 1 

WDLLYWRDI KKTGWFGASLFLLLSLTVF' 

I VSVTAYI ALALLSVTI SFRI YKGVI QAI QKSDEGHPFRAYLESEVAI SEELVQKYSNSi 

IMMMIIIMIMMMMMIIIIMIIMMIIMMMIIIMMIMIIMM 

I VSVTAYI ALALLSVTI SFRI YKGVI QAI QKSDEGHPFRAYLESEVAI SEELVQKYSNSJ 
LGHVNSTI KELRRLFLVDDLVDSLKFAVLMW VFTY VGALFNGLTLL I LAL I SLFS I PVI 1 

Mill HI I I I I I I i : I ! U I I M : | |;| , M 

LGHVNCTI KELRRLFLVDDLVDSLKFAVLMWVFTYVGALFNGLTLLI LALI SLFSVPVI 1 
ERHQVQI DHYLGLANKS VKDAMAKI QAKI PGLKRKAD 1163 

I I I I MllllllllhllllMllllllllllllh 



RESULT 13 
US-09-893-348-21 

Sequence 21, Application US/09893348 
Patent No. US20020072493A1 
GENERAL INFORMATION: 



APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 



E I S ENBACH - S CHWARTZ , Michal 
COHEN, I run R. 
BESERMAN, Pierre 
MOSONEGO, Alon 
MOALEM , Gila 



TITLE OF INVENTION: ACTIVATED T-CELLS, NERVOUS SYSTEM -SPECIFIC ANTIGENS AND 
THEIR USES 

FILE REFERENCE: EIS-SCHWARTZ=2A 
CURRENT APPLICATION NUMBER: US/0 9/8 93 , 34 8 
CURRENT FILING DATE: 2001-06-28 
PRIOR APPLICATION NUMBER: US 09/314,161 
PRIOR FILING DATE: 1999-05-19 
PRIOR APPLICATION NUMBER : US 09/218,277 
PRIOR FILING DATE: 1998-12-22 
PRIOR APPLICATION NUMBER: PCT/US98/14715 
PRIOR FILING DATE: 1998-07-21 
PRIOR APPLICATION NUMBER: IL 124500 
PRIOR FILING DATE: 1998-05-19 
NUMBER OF SEQ ID NOS : 2 9 
SOFTWARE: Patent In version 3.1 
SEQ ID NO 21 
LENGTH : 19 9 
TYPE : PRT 

ORGANISM: Rattus norvegicus 
US-09-893-348-21 



Query Match 15.8%; Score 925; DB 9; Length 199; 

Best Local Similarity 99.0%; Pred. No. 1.7e-39; 



Matches 189; Conservative 0; Mismatches 2; Indels 0; Gaps 



0; 



Ov 


973 


Db 


Q 


Ov 


1033 


Db 


69 


Qy 


1093 


Db 


129 


Qy 


1153 


Db 


189 



I IMIIIMMIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIMIIIMIIIIIIIII 

KDKWDLLYWRDIKKTGWFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVI 
QAIQKSDEGHPFRAYLESEVAISEELVQKYSNSALGHVNSTIKELRRLFLVDDLVDSLKF 

IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIMIIIIMIMIIIIIIIIMIII 

QAI QKSDEGHPFRAYLESEVAI SEELVQKYSNSALGHVNSTI KELRRLFLVDDLVDSLKF 
AVLMWVFTYVGALFNGLTLLI LAL I SLFS I PVI YERHQVQ I DHYLGLANKS VKDAMAKI Q 

IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIMIIII 

AVLMWVFTYVGALFNGLTLLI LAL I SLFS I PVI YERHQVQ I DHYLGLANKS VKDAMAKI Q 
AKI PGLKRKAD 1163 

MINIMI 



RESULT 14 
US-09-893-348-25 

Sequence 25, Application US/09893348 
Patent No. US2 00200724 93A1 
GENERAL INFORMATION: 
APPLICANT: EI SENBACH- SCHWARTZ , Michal 
APPLICANT: COHEN, I run R. 
APPLICANT: BESERMAN, Pierre 
APPLICANT: MOSONEGO, Alon 
APPLICANT: MOALEM, Gila 

TITLE OF INVENTION: ACTIVATED T-CELLS, NERVOUS SYSTEM-SPECIFIC ANTIGENS AND 
THEIR USES 

FILE REFERENCE: EIS -SCHWARTZ-2A 
CURRENT APPLICATION NUMBER: US/09/8 93 , 348 
CURRENT FILING DATE: 2001-06-28 
PRIOR APPLICATION NUMBER: US 09/314,161 
PRIOR FILING DATE: 1999-05-19 
PRIOR APPLICATION NUMBER: US 09/218,277 
PRIOR FILING DATE: 1998-12-22 
PRIOR APPLICATION NUMBER: PCT/US98 / 14715 
PRIOR FILING DATE: 1998-07-21 
PRIOR APPLICATION NUMBER: IL 124500 
PRIOR FILING DATE: 1998-05-19 
NUMBER OF SEQ ID NOS : 2 9 
SOFTWARE: Patent In vers ion 3 . 1 
SEQ ID NO 25 
LENGTH: 199 
TYPE : PRT 

ORGANISM: Homo sapiens 
US-09-893-348-25 

Query Match 15.5%; Score 908; DB 9; Length 199; 

Best Local Similarity 96.3%; Pred. No. 1.2e-38; 

Matches 184; Conservative 3; Mismatches 4; Indels 0; Gaps 0; 
Qy 973 KTS WDLLYWRDI KKTGWFGASLFLLLSLTVFS I VS VTAYI ALALLSVTI SFRI YKGVI 1032 

I I IMIMM Ml IMMMmMIMI MMMIM MIMMMMMM 

Db 9 KDKWDLLYWRD I KKTGWFGASLFLLLSLTVFS I VSVTAYI ALALLSVTI SFRI YKGVI 68 



Qy 


1033 


QAIQKSDEGHPFRAYLESEVAISEELVQKYSNSALGHWSTI KELRRLFLVDDLVDSLKF 

lilllllllllllllMlllilllMIIIIIIIIIIIII lllllllllllllllllill 


1092 


Db 


69 


QAI QKSDEGHPFRAYLESEVAI SEELVQKYSNSALGHVNCT I KELRRLFLVDDLVDSLKF 


128 


Qy 


1093 


AVLMWVFTY VGALFNGLTLLI LALI SLFS I PVI YERHQVQI DHYLGLANKS VKDAMAKI Q 


1152 


Db 


129 


i i i i i i i i i i i i iniiiiiiihiiiiiiiii 

AVLMWVFTYVGALFNGLTLLI LALI SLFS VPVI YERHQAQI DHYLGLANKNVKDAMAKI Q 


188 



Qy 1153 AKI PGLKRKAD 1163 

lllllllllh 
Db 18 9 AKI PGLKRKAE 19 9 



RESULT 15 
US-09 -789-386-4 

; Sequence 4, Application US/09789386 

; Patent No. US20020010324A1 

; GENERAL INFORMATION: 

; APPLICANT: MICHALOVICH, DAVID 

; APPLICANT: PRINJHA, RABINDER KUMAR 

; TITLE OF INVENTION: NOVEL COMPOUNDS 

FILE REFERENCE: GP-30165-C1 
; CURRENT APPLICATION NUMBER: US/ 09/78 9 , 3 8 6 
; CURRENT FILING DATE: 2001-02-21 

PRIOR APPLICATION NUMBER: U.K. 9916898.1 
; PRIOR FILING DATE: 1999-07-19 
; PRIOR APPLICATION NUMBER: U.K. 9816024.5 
; PRIOR FILING DATE: 1998-07-22 
; PRIOR APPLICATION NUMBER: US 09/359,208 

PRIOR FILING DATE: 1999-07-22 
; NUMBER OF SEQ ID NOS : 6 

SOFTWARE: FastSEQ for Windows Version 3.0 
; SEQ ID NO 4 

LENGTH: 28 9 
TYPE : PRT 

ORGANISM: HOMO SAPIENS 
FEATURE : 

NAME/ KEY: UNSURE 
LOCATION: (31) (138) 
US-09-789-386-4 



Query Match 15.2%; Score 888; DB 9; Length 28 9; 

Best Local Similarity 63.9%; Pred. No. 2e-37; 

Matches 184; Conservative 34; Mismatches 58; Indels 12; Gaps 5; 
Qy 195 MDLMEQPGNTVSSGQEDFPSVLLETAASLPSLSPLSTVSFKEHGYLGNLSAVSSSEGTIE 254 

III II III MMI Mllllllllllll lllllll MM llllll III- 

Db 3 MDLKEQPGNTISAGQEDFPSVLLETAASXPSLSPLSAASFKEHEYLGNLSTVLPTEGTLQ 62 

Qy 255 ETLNEASKELPERATNPFVNRDLAEFSELEYSEMGSSFKGSPKGESAILVENTKEEVIVR 314 

I -MM : |:| ::||| 1 1 1 1 1 1 1 1 1 1 1 1 1 1 Ml Mh:| I MMM 

Db 63 ENVSEASKEVSEKAKTLLI DRDLTEFSELEYSEMGSSFSVSPKAESAVI VANPREEI I VK 122 
Qy 315 S KDKED - LVCSAALHS PQES PVG KEDR WS PEKTMD I FNEMQMS WAPVREE YAD 368 

Mhh II : II II I III Ml II MMM 

Db 123 NKDEEEKLVSNNILHXQQELPTALTKLVKEDEWSSEKAKDSFNEKRVAVEAPMREEYAD 182 



Qy 369 FKPFEQAWEVKDTYEGSRDVLAA RANVESKVDRKCLEDSLEQKSLGKDSEGRNEDA 424 

llllh I II h I I hill I I I I I I I I Mill : I I I I hi 

Db 183 FKPFERVWEVKDSKEDS-DMLAAGGKIESNLESKVDKKCFADSLEQTNHEKDSESSNDDT 241 

Qy 425 SFPSTPEPVKDSSRAYITCASFT-SATESTTANTFPLLEDHTSENKTD 471 

I I I I M I : I I I I I I I I I I HIM I I I I I I I I I E I I I 
Db 242 SFPSTPEGIKDRSGAYITCAPFNPAATESIATNIFPLLGDPTSENKTD 289 



Search completed: December 19, 2003, 15:42:41 
Job time : 45 sees 



GenCore version 5.1.6 
Copyright (c) 19 93 - 2003 Compugen Ltd. 



OM protein - protein search, using sw model 



Run on: 



December 19, 2003, 15:33:28 ; Search time 27 Seconds 

(without alignments) 
4142.377 Million cell updates/sec 



Title: 

Perfect score: 
Sequence : 



US-09-830-972-2 
5848 

1 MEDIDQSSLVSSSTDSPPRP VKDAMAKI QAKI PGLKRKAD 1163 



Scoring table: 



BLOSUM62 

Gapop 10.0 , Gapext 0.5 



Searched : 



283308 seqs, 96168682 residues 



Total number of hits satisfying chosen parameters: 



283308 



Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 



Post-processing: Minimum Match 0% 

Maximum Match 10 0% 
Listing first 45 summaries 



Database : 



PIRJ76:* 
1: pirl:* 
pir2 : * 
pir3 : * 
pir4 : * 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 



SUMMARIES 

Result Query 

No. Score Match Length DB ID Description 



1 


789.5 


13 


.5 


776 


2 


A46583 


neuroendocrine - spe 


2 


'685 


11 


. 7 


208 


2 


160904 


neuroendocrine- spe 


3 


671 


11 


5 


267 


2 


A60021 


tropomyosin- relate 


4 


517 


8 


8 


2484 


2 


T26216 


hypothetical prote 


5 


503 .5 


8 


6 


2607 


2 


T26215 


hypothetical prote 


6 


328.5 


5 


6 


5327 


2 


T13564 


microtubule -associ 


7 


322 


5 


5 


7962 


2 


138346 


elastic titin - hu 


8 


320 


5 


5 


222 


2 


T26213 


hypothetical prote 


9 


304 .5 


5 


2 


873 


2 


A47283 


calphotin - fruit 


10 


302.5 


5 


2 


1829 


2 


T24583 


hypothetical prote 


11 


295.5 


5 


1 


865 


2 


A47282 


calcium-binding pr 


12 


292 


5 


0 


3507 


2 


T34513 


hypothetical prote 


13 


291 . 5 


5 


0 


2364 


2 


A56577 


microtubule -associ 



14 


288 . 5 


4 


. 9 


971 


2 


T19431 


hypothetical prote 


15 


284 


4 


.9 


2464 


1 


QRMSP1 


microtubule-associ 


16 


281 


4 


. 8 


3488 


2 


T34418 


hypothetical prote 


17 


279.5 


4 


. 8 


1262 


2 


T22523 


hypothetical prote 


18 


277 


4 


. 7 


1621 


2 


A82255 


hypothetical prote 


19 


275 . 5 


4 


. 7 


3924 


2 


S37431 


ankyrin 2 , neurona 


20 


275 


4 


. 7 


1299 


2 


T47182 


hypothetical prote 


21 


273 .5 


4 


. 7 


1029 


2 


T30351 


mucin-like protein 


22 


273 .5 


4 


.7 


1274 


2 


T16251 


hypothetical prote 


23 


273 


4 


. 7 


1558 


2 


B71603 


RESA-H3 antigen PF 


24 


272 . 5 


4 


. 7 


3534 


2 


T42567 


tegument protein 2 


25 


270 


4 


.6 


1230 


2 


T22458 


hypothetical prote 


26 


269.5 


4 


.6- 


2187 


2 


T30826 


nascent polypeptid 


27 


267 


4 


. 6 


1684 


2 


JW0057 


gravin - human 


28 


265.5 


4 


.5 


1828 


2 


A40115 


microtubule-associ 


29 


2 63 


4 


. 5 


1825 


2 


S13507 


microtubule-associ 


30 


261.5 


4 


. 5 


1087 


1 


QFMSH 


neurofilament trip 


31 


261 


4 , 


. 5 


606 


2 


A43427 


neurofilament trip 


32 


261 


4 . 


. 5 


2570 


2 


T17451 


fimbriae- associate 


33 


260 


4. 


. 4 


1804 


2 


T34518 


nestin - golden ha 


34 


259 . 5 


4 . 


.4 


1020 


1 


QFHUH 


neurofilament trip 


35 


259 


4 . 


.4 


1510 


2 


T33100 


hypothetical prote 


36 


258 . 5 


4 . 


.4 


1830 


2 


A37981 


microtubule-associ 


37 


257 


4 . 


.4 


1224 


2 


T14007 


microtubule-associ 


38 


256 


4 . 


.4 


2361 


2 


T25752 


hypothetical prote 


3 9 


254 . 5 


4 . 


. 4 


6642 


2 


T29757 


protein UNC-89 - C 


40 


254 


4 . 


.3 


1616 


2 


G64242 


cyt adherence -acces 


41 


254 


4 . 


.3 


3381 


2 


T42389 


versican precursor 


42 


253 


4 . 


3 


3421 


1 


WZBEB6 


367K tegument prot 


43 


252 .5 


4. 


3 


5170 


2 


T15348 


hypothetical prote 


44 


251 


4 . 


3 


1824 


1 


QRHUMT 


microtubule-associ 


45 


250.5 


4 . 


3 


4377 


2 


A55575 


ankyrin 3, long sp 



ALIGNMENTS 



RESULT 1 
A46583 

neuroendocrine-specif ic protein, splice form A - human 

N; Contains : neuroendocrine-specif ic protein, splice form B 

C; Species: Homo sapiens (man) 

C;Date: 24-May-1996 #sequence_revision 24-May-1996 #text_change 05-Nov-1999 
C;Accession: A46583; 160903 

R;Roebroek, A.J.; van de Velde, H.J.; Van Bokhoven, A. ; Broers, J.L.; Ramaekers, 
F.C. ; Van de Ven, W.J. 

J. Biol. Chem. 268, 13439-13447, 1993 

A; Title: Cloning and expression of alternative transcripts of a novel 
neuroendocrine-specif ic gene and identification of its 135-kDa translational 
product . 

A/Reference number: A46583; MUID : 93293865 ; PMID: 7685762 
A;Accession: A46583 

A; Status: preliminary; translated from GB/EMBL/DDBJ 
A; Molecule type: mRNA 
A;Residues: 1-776 <R0E1> 

A; Cross-references: GB:L10333; NID:g307306; PIDN : AAA59950 . 1 ; PID:g307307 
A; Access ion: 160 903 



A;Status: preliminary; translated from GB/EMBL/DDBJ 
A; Molecule type: mRNA 
A/Residues : 421-776 <R0E2> 

A; Cross-references: GB:L10334; NID:g307308; PIDN : AAA59951 . 1 ; PID:g307309 

C;Genetics: 

A; Gene: GDB : RTN1 ; NSP 

A; Cross-references : GDB:203968; OMIM:600865 
A ; Map position: 14q21-14q22 

Query Match 13.5%; Score 78 9.5; DB 2; Length 776; 

Best Local Similarity 31.2%; Pred. No. 3.4e-25; 

Matches 243; Conservative 115; Mismatches 252; Indels 169; Gaps 28; 

Qy 487 TS PKTSN P FLVAVQDS EAD YVTTDTLS KVTEAAVSNM PEGLT PDLVQEACES ELNEATGT 54 6 

: I : I M- ■ ■■ I = I II :| : I 

Db 65 SGPARQSP- -VAMETASTGVAGVSSAMDHTFSTTSKDGEG SCYTSLI S 110 

Qy 547 KIAYETKVDLVQTSEAIQ-ESLYPTAQLCPSFEEAEATPSPVLPDI - -VMEAPLNS 599 

I I ■ I : :| h : | | | || | |||: : | | 

Db 111 DICYPPQEDSTYFTGILQKENGHVTISESP EELGTPGPSLPDVPGIESRGLFSSDSG 167 

Qy 600 - -LLPSAGASWQPSVSPLEAPPPVSY DSIKLEPENPPPYEEA M 641 

Db 168 IEMTPAESTEVNKILADPLDQMKAEAYKYIDITRPEEVKHQEQHHPELEDKDLDFKNKDT 227 

Qy 642 NVALKALGTKEGI KEPE S FNAAVQETEAPY I S I ACDL I KETKLSTE - PS P 690 

lh = ll= = : I I I I I II =| = : = =| ■ 

Db 228 DISIK PEGVREPDKPAPVEGKI I KDHLLEESTFAPYI D DLSEEQRRAPQI TTP 280 

QY 691 DFSNYSEIAKFEKSVPEHAELVEDSSPESEPVDLF-SDDSI PEV PQTQEEAVMLMK 745 

: I I I :: :|| : : | | |::| | |: 

Db 281 VKITLTEIE PSVETTTQEKTPEKQDICLKPSPDTVPTVTVSEPEDDSPGSITPP 334 

Qy 74 6 ESLTEVSETVAQHK EERLSASPQELGKPYLESFQP NLHSTKDAASND 792 

I M I Ml :| =:| : I I : | | : : 

Db 335 SSGTEPSAAESQGKGSISEDELITAIKEAKGLSYETAENPRPVGQLADRPEVKARSGPPT 394 

Qy 793 I PTLTKKEKI SLQMEEFNTAI YSNDDLLSSKEDKI 827 

Ih II:: : I I : : II : 

Db 395 I PSPLDHEASSAESGDSEI ELVSEDPM - - AAEDALPSGYVSFGHVGGPPPS PAS PS I QYS 452 

Qy 828 KESETFSDSSPIEI IDEFPTFVSAKDDSPKLAKEYTDLEVSDKSEI ANIQSG 879 

: I I II hi I II -III :: :: | | 

Db 453 I LREEREAELDS ELI I ESCDASSASEESPKREQDSPPMKPSALDAI REETGVRAEER 509 

Qy 880 ADSLPCLELPCDLSFKNIYPKDEVHVSDEFSENRSSVSKASISPSNVSALEPQTEMGSIV 939 

II II II ! II I I I I II hi I 

Db 510 APSRRGLAEPG- -SFLD- YPSTEPQPGPEL PPGDGALEPETPM 549 

Qy 94 0 KSKSLTKEAEKKLPSDTEKEDRSLSAVLSAE LSKTSWDLLYWRD 984 

M I- | : : : |:| :||||||| 

Db 550 LPRKPEEDSSSNQSPAATKGPGPLGPGAPPPLLFLNKQKAI DLLYWRD 597 

Qy 985 I KKTGWFGASLFLLLSLTVFS I VSVTAYI ALALLS VTI SFRI YKGVI QAI QKSDEGHPF 1044 

IMhllh I II III Ihlll Ihlll II IIIIMII HMNIIIIl 

Db 598 IKQTGIVFGSFLLLLFSLTQFSWSWAYLALAALSATISFRIYKSVLQAVQKTDEGHPF 657 



Qy 1045 RAYLESEVAI SEELVQKYSNSALGHVNSTI KELRRLFLVDDLVDSLKFAVLMWVFTYVGA 1104 

: I I I I h MM , MIIIMIIIIMM llllllllllllh Mill 

Db 658 KAYLELE I TLSQEQI QKYTDCLQFYWSTLKELRRLFLVQDL VDSLKFAVLMWLLTYVGA 717 

Qy 1105 LFNGLTLLI LALI SLFS I PVI YERHQVQI DHYLGLANKSVKDAMAKI QAKI PGLKRKAD 1163 

I I I I I M I : : I : : I : I : : I I : I :|| ||| |||| = MIIMIMI || |: 

Db 718 LFNGLTLLLMAWSMFTLPVVYVKHQAQI DQYLGLWTHINAWAKI QAKI PGAKRHAE 776 



RESULT 2 
160904 

neuroendocrine-specif ic protein C - human 
C; Species: Homo sapiens (man) 

C;Date: 24-May-1996 #sequence_revision 24-May-1996 #text_change 05-Nov-1999 
C /Access ion: 160904 

R;Roebroek, A.J.; van de Velde, H.J.; Van Bokhoven, A. ; Broers , J.L.; Ramaekers, 
F.C. ; Van de Ven, W.J. 

J. Biol. Chem. 268, 13439-13447, 1993 

A; Title: Cloning and expression of alternative transcripts of a novel 
neuroendocrine-specif ic gene and identification of its 135-kDa translational 
product . 

A; Reference number: A46583; MUID : 93293865 ; PMID: 7685762 
A /Access ion: 160904 

A; Status : preliminary; translated from GB/EMBL/DDBJ 
A /Molecule type: mRNA 
A/Residues : 1-208 <RES> 

A; Cross-references: GB:L10335; NID:g307310; PIDN : AAA59952 . 1 / PID:g307311 
C; Genetics: 

A /Gene: GDB : RTN1 / NSP 

A/Cross-references: GDB: 203968/ OMIM : 600865 
A/Map position: 14q21-14q22 



Query Match 11.7%; Score 685/ DB 2; Length 208; 

Best Local Similarity 67.0%; Pred . No. le-21; 

Matches 128; Conservative 32; Mismatches 31; Indels 0; Gaps 0; 



Qy 


973 


KTSVVDLLYWRDIKKTGVVFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVI 


1032 


Db 


18 


h 1 1 1 1 H 1 1 II Ml Ihlll Ihlll II MINIM h 

KSQAIDLLYWRDIKQTGI VFGSFLLLLFSLTQFSWSWAYLALAALSATISFRI YKSVL 


77 


Qy 


1033 


QAIQKSDEGHPFRAYLESEVAI SEELVQKYSNSALGHVNSTI KELRRLFLVDDLVDSLKF 


1092 


Db 


78 


MMIMIMMMMI h :h| :|||:: = | | | | : | || | || | | | |||||||| 
QAVQKTDEGHPFKAYLELEITLSQEQIQKYTDCLQFYVNSTLKELRRLFLVQDLVDSLKF 


137 


Qy 


1093 


AVLMWVFTYVGALFNGLTLLI LALI SLFS I PVI YERHQVQI DHYLGLANKSVKDAMAKI Q 
M II h 1 1 1 1 1 1 1 1 1 1 1 1 1 : : h : h h : 1 1 : 1 :|| ||| MM : Mill 


1152 


Db 


138 


A VLMWLLTYVGALFNGLTLLLMAWSMFTLPWYVKHQAQI DQYLGLVRTHI NAWAKI Q 


197 


Qy 


1153 


AKI PGLKRKAD 1163 




Db 


198 


Mill M h 

AKI PGAKRHAE 2 08 





RESULT 3 
A60021 

tropomyosin-related protein, neuronal - rat 
C; Species: Rattus norvegicus (Norway rat) 



C;Date: 03-Mar-1993 #sequence_revision 03-Mar-1993 #text_change 05-Nov-1999 

C; Access ion: AGO 021 

R;Wieczorek, D.F.; Hughes, S.R. 

Brain Res. Mol . Brain Res. 10, 33-41, 1991 

A; Title: Developmentally regulated cDNA expressed exclusively in neural tissue. 
A;Reference number: A60021; MUID : 91278684 ; PMID : 1647480 
A, -Access ion: A60021 
A; Molecule type: mRNA 
A;Residues: 1-267 <WIE> 

A; Cross-references : EMBL:X52817 ; NID:g456549; PIDN : CAA37001 . 1 ; PID:g456550 
C; Comment: This neuronal -specif ic mRNA was identified by hybridization to an 
alpha -tropomyosin probe but does not show homology in amino acid sequence. 

Query Match 11.5%; Score 671; DB 2; Length 267; 

Best Local Similarity 66.3%; Pred. No. 5.4e-21 ; 

Matches 124; Conservative 33; Mismatches 30; Indels 0; Gaps 0; 



Qy 973 KTSWDLLYWRDI KKTGWFGASLFLLLSLTVFSI VS VTAYIALALLSVTI SFRI YKGVI 1032 

h Hlllllllhlhlll: I || Ml IhMI ||:||| || MMIIII h 
Db 9 KSQAIDLLYWRDI KQTGI VFGSFLLLLFSLTQFSWSWAYLALAALSATISFRI YKSVL 68 

Qy 1033 QAIQKSDEGHPFRAYLESEVAISEELVQKYSNSALGHVNSTIKELRRLFLVDDLVDSLKF 1092 

IhlhllMlhllll h :hl :||h: = | | | | = | | | | | | | | | MMIIII 

Db 69 QAVQKTDEGHPFKAYLELEITLSQEQIQKYTDCLQLYVNSTLKELRRLFLVQDLVDSLKF 128 

Qy 1093 AVLMWVFTYVGALFNGLTLLI LALI SLFS I PVI YERHQVQI DHYLGLANKS VKDAMAKI Q 1152 

11111= lllllllllllll-l-H-lhl :|| hi I | | | : :|||| 

Db 12 9 AVT.MWLLTYVGALFNGLTLLLMAWSMFTLPVWVKHQAQVDQYLGLWTHINTWAKIQ 18 8 

Qy 1153 AKIPGLK 1159 

Mill 

Db 18 9 AKIPGAR 195 



RESULT 4 
T26216 

hypothetical protein W06A7.3c - Caenorhabditis elegans 
C; Species: Caenorhabditis elegans 

C;Date: 15-Oct-1999 #sequence_revision 15-Oct-1999 #text_change 15-Oct-1999 
C; Access ion: T26216 
R;Ainscough, R. 

submitted to the EMBL Data Library, August 1996 
A; Reference number: Z20173 
A; Access ion: T2 6216 

A; Status : preliminary; translated from GB/EMBL/DDBJ 
A; Molecule type: DNA 
A;Residues: 1-2484 <WIL> 

A; Cross-references: EMBL:Z78066; PIDN : CAB51467 . 1 ; GSPDB : GN00023 ; CESP : W06A7 . 3c 

A; Experimental source: clone W06A7 

C; Genetics : 

A; Gene: CESP:W06A7 .3c 

A ; Map position: 5 

A;Introns: 1827/1; 1866/3; 1963/3; 1990/3; 2262/1; 2289/1; 2339/2; 2463/2 

Query Match 8.8%; Score 517; DB 2; Length 2484; 

Best Local Similarity 20.1%; Pred. No. 2.6e-13; 

Matches 262; Conservative 189; Mismatches 413; Indels 440; Gaps 51; 



Qy 28 FVTEPEDEE DEEEEEDEEED DEDLEELEVL 57 

I II :| I = | I I I I llh h | 

Db 1437 FGTESSEESQKADGNQENQEEEDWAELNFHPIRQWRDEDVISLQSLKSLVAEVGCITDV 1496 

Qy 58 ERKPAAGLSAAAVPPAAAAPL-LDFSSDSVPPAPRGPLPAAPPAAP- -ERQPSW 108 

h : I I h : I III-! I hi II I I 

Db 1497 DASDVNEQDEESTLKILKWPSEPSLLELDFTND- -PKVI HVPI PLMEPATMYLEEMVEW 1554 

Qy 109 ERSPAAPAPSLPPAAAVLPS KLP-EDDEPPARPPPPPPA 146 

Db 1555 IIADAVKEVSEMEWTESEISEMAPQVSESTCPIPEPLADLKLPVEDDEKTPEPEPW 1614 

Qy 147 GAS PLAE PAAP PSTPAAPKRRGSGS VDETLFALPAAS E P VI PS - SAEKI MDLMEQPGNTV 2 05 

I I hi I II =:|= I I = I : I I 

Db 1615 QVQERI IPIEVEQAPTI PQRPPRAPKSE LPKVAKPLDDSKSRVRFAPLNI KLGRTY 1670 

Qy 2 06 SSGQEDFPSVLLETAASLPSLSPLSTVSFKEHGYLGNLSAVSSSEGTIEETLNEASKELP 2 65 

I h III Ih - :: | 

Db 1671 SEEQQK ELVESLE- -RPLTI IT QQKPP 1695 

Qy 266 ERAT NPFVNRDLAEFSELEYSEMGSSFKGSPKGESAILVENTKEEVI VRSKDKE 319 

Db 1696 EKPTED I GALS PLS PNTLAE YEEVPMMDMQS 1726 

Qy 320 DLVCSAALHSPQESPVGKEDRWSPEKTMDIFNEMQ-MSWAPVREEYADFKPFEQAWEV 378 

Mill h: : I :| I I I I II I I I 

Db 1727 VPHSPQE KQEEI EALSEI IEEPQAMKEVE KPVESAPE- 1763 

Qy 379 KDTYEGSRDVLiAARANVESKVDRKCLEDSL- - EQKSLGKDSEGRNEDASFPSTPEPVKDS 436 

II : | | : : | :| : ||| :| |:| 

Db 1764 KD NESLEAPEI INEPIRRVLVETKIMGPGKSLNED NDD 1801 

Qy 437 SRAYI TCASFTSATESTTANTFPLLEDHTSENKTDEKKI EERKAQI I TEKTSPKTSNPFL 496 

■■\ \\ ■ II II 

Db 1802 DDDGS ECLDS I GDLS ER TIQRFN 1824 

Qy 4 97 VAVQDS EADYVTTDTLS KVT EAAVSNMPEGLTPDLVQEACESELNEATGTKI 548 

: : I = h I - h h : I I 
Db 1825 TS I DDPS 1 RRDSFSS I SSFGDRQKFRTAI EN I RQDLLP 1862 

Qy 54 9 AYETKVDLVQTSEAIQESLYPTAQLCPSFEEAEATPSPVLPDIVMEAPLNSLLPSAGASV 6 08 

= = = I h - I h II - - hH : I hi 
Db 1863 -FQSSV- - - - -SQYLRSSPNPSQQL- - - - - -LVTNLSMDSP-SDLSPNA 1898 

Qy 609 VQPSVSPLEAPPPVSYDSI KLE P EN P P P YE EAMNVALKALGTKEG I KE P ES FNAA 663 

Db 1899 PPVGFENTAQFLEKLQQEDRPSAEGS I D SSGFEKVD- -HEG 1937 

Qy 664 VQETEAPYI S IACDLI KETKLSTEPSPDF SNYSEIAKFEKSVPE 707 . 

1 I M I = = = = I I =| = : I H 

Db 1938 LDEFAAPPVH DPMQKSVFGSLGSDDMKPGSQDDGFVFI ERNEANEATLKKNQKMSSH 1994 

Qy 708 HAELVEDS SPESEPVDLFSDDS I PEVPQTQEEAV MLMKESLTE 750 

I =-l ■ II :| I- : ■ ■ \ \ : :| | 

Db 1995 HNDVI EKNYFNDNAPTAALLES P I AEEARKLVQDAVESAS E YKKQAVDSGDE I GRELLDN 2054 



Qy 


751 


VSETVAQHKE ERLSAS POELGKPYLESFOP 

1 : = 1 II 1 I: :h :| 


780 


Db 


2055 


VEQKIEQVKEPI VDSLHKAYDGVGDFVHETVPNAVDDFVREAEKQLPESPVPEKIETPEP 


2114 


Qy 


781 


- - NLHSTKDAASND I PTLTKKEKI SLQMEEFNTAI YSNDDLLS SKEDK I KES 


ft 7 0 

O O \J 


Db 


2115 


-1 1 1 = = = -1 1 : Ih :|| 

LVDIHDTVDKVHDEVDNFLRREP TPPFETDDVAPT.^nnKPOFGNOTPFFnF 




Qy 


831 


ETFSDSS P I E 1 1 DEFPTFVSAKDDSPKLAKEYTDLEV^DK^RT AMT nc;n a n^T.PPT . 


A ft £ 


Db 


2166 


TTFDRKGPLTI PEEVE KAAAAQNND LDDFDPLVTSNTGAAFGAAVGAAAAVESLTEE 


Z. j£ j6 


Qy 


887 


ELPCDLSFKNI YPKDEVHVSDEFSENRSSVSKASI SPSNV^A - T.FPOTFM 


QOC 
> J J 


Db 


2223 


h h : Ml :||| : || |: | : 
EMFGHQKFETVPRPPTPPKD ISDE DVKPSTVNLGPSHHHSHPSSPH 


ZZDO 


Qy 


936 


GS I VKSKSLTKEAEKKLPSDTEKEDRSLSAVLSAELSKTSWDLLYWRDI KKTGWFGAS 
M l 1 = ■■ hl-llll Ih :| = 


_/ _? 3 


Db 


2269 


HSILKHHG DAWIDFKTVPPPVT.nVT YWRnAKK^A TVT.CJT.A 




Qy 


996 


LFLLLSLTVFSIVSVTAYIALALLSVTI SFRTYKOVTOATOK^rJFfiHPFRAYT.FC! wn t c; 

1 : 1 1 : » = 1 1 1 ll-l 1 hhl III 1 - : 

LLVLFVLAKYPLLTVVTYSLLLALGAAAGFRVFKKVEAQI KKTDSEHPFSEI LAQDLTLP 




Db 


2309 


9 7 fift 


Qy 


1056 


EELVQKYSNSALGHVNSTI KELRRLFL VTOLVDSLKFAVLMWVFTYVGALFNGLTLLI LA 


1115 


Db 


2369 


= 11 - ' 1 = 1-1 1 = ::hll = : = | Ih = hi || || 
QEKVHAQADVFVEHATCIANKLKKLVFVESPLESIKFGLVLWSLTYIASWFSGFTLAILG 


2428 


Qy 


1116 


LI SLFS I PVI YERHQVQI DHYLGLANKSVKDAMAKI QAKI PGLK 115 9 
h Hhl :|| :| II :| : :|: 1 hi h 




Db 


2429 


LLGVFSVPKVYESNQEAIDPHLATISGHLKNVQNIIDEKLPFLR 2472 





RESULT 5 
T26215 

hypothetical protein W06A7.3a - Caenorhabditis elegans 
C; Species : Caenorhabditis elegans 

C;Date: 15-Oct-1999 #sequence_revision 15-Oct-1999 #text_change 15-Oct-1999 
C; Access ion: T26215 
R;Ains cough, R . 

submitted to the EMBL Data Library, August 1996 
A/Reference number: Z20173 
A; Access ion : T26215 

A ; Status: preliminary; translated from GB/EMBL/DDBJ 
A /Molecule type: DNA 
A;Residues: 1-2607 <WIL> 

A;Cross-references : EMBL: Z7 8 066; PIDN : CAB01522 . 2 ; GSPDB : GN00023 ; CESP : W06A7 . 3a 

A; Experimental source: clone W06A7 

C;Genetics : 

A; Gene : CESP: W06A7 . 3a 

A; Map position: 5 

A;Introns: 1827/1; 1866/3; 1963/3; 1990/3; 2262/1; 2289/1; 2412/1; 2462/2; 
2586/2 

Query Match 8.6%; Score 5 03.5; DB 2; Length 2607; 

Best Local Similarity 20.8%; Pred. No. le-12; 

Matches 280; Conservative 181; Mismatches 481; Indels 407; Gaps 52; 



28 FVTEPEDEE DEEEEEDEEED DEDLEELEVL 57 

III : I I U I I I I llh I I 

1437 FGTESSEESQKADGNQENQEEEDWAELNFHPIRQWRDEDVISLQSLKSLVAEVGCITDV 14 96 

58 ERKPAAGLSAAAVP PAAAAPL - LDFSSDS VP PAPRGPLPAAP PAAP - - ERQPSW 108 

h : I I |: : I lll-l I hi II I.I 

14 97 DASDVNEQDEESTLKI LKWPSEPSLLELDFTND - - PKVI HVPI PLMEPATMYLEEMVEW 1554 

109 ERSPAAPAPSLPPAAAVLPS KLP-EDDEPPARPPPPPPA 14 6 

I = II = = M Ml MM I I I 

1555 IIADAVKEVSEMEWTESEISEMAPQVSESTCPIPEPLADLKLPVEDDEKTPEPEPWPG 1614 

147 GASPLAEPAAPPSTPAAPKRRGSGSVDETLFALPAASEPVIPS-SAEKIMDLMEQPGNTV 205 

I" I hi I II = = h ||: I : I | 

1615 QVQERI IPIEVEQAPTIPQRPPRAPKSE LPKVAKPLDDSKSRVRFAPLNI KLGRTY 1670 

2 06 SSGQEDFPSVLLETAASLPSLSPLSTVSFKEHGYLGNLSAVSSSEGTIEETLNEASKELP 265 

I h III Ih - :: | 

1671 SEEQQK ELVESLE- -RPLTI IT ___qqkpp 1695 

266 ERAT NPFVNRDLAEFSELEYSEMGSSFKGSPKGESAILVENTKEEVIVRSKDKE 319 

I : I : I MM h =| I II : :\\: |: \ 

1696 EKPTEDI GALS PLS PNTLAE YEE VPMMDM - QS VPHS P QEKQEEIEALSEI IE 1746 

320 DLVCSAALHS PQES - PVGKEDR WS PEKTMD I FNEMQMS WAP VREEYADFK PFEQA 375 

■ I II I Ill hi : I I : 

1747 EPQAMKEVEKPVESAPEKDNESLEAPE IINE PIRRVLVETKIMGPGKSL 1795 

3 76 WEVKDTYEGSRDVLAARANVESKVDRKCLEDSLEQKSLGKDS E 418 

I I : : | : : : : : : |:: |: :|| | 

1796 NEDNDDDDDGSECLDS I GDLSERTI QR- FNTS I DDPS I RRDSFSS I SSFGDRQKFRTAI E 18 54 

419 GRNED ASFPSTPEPVKDSSRAYITCASFTSATESTT ANTFPLLE 462 

: I hi I I : M I I - : II II 

1855 NI RQDLLPFQSSVSQYLRSSPNP - - - SQQLLVTNLSMDSPSDLS PNAP P VGFENTAQFLE 1911 

463 DHTSENKTDEKKI EERKAQI I TEKTSPKTSNPF 495 

1912 KLQQEDRPSAEGS I DS SGFEKVDHEGLDEFAAPPVHDPMQKS VFGSLGSDDMKPGSQDDG 1971 

4 96 LVAVQDS EAD YVT TDTLSK VTEAAVSNMPEGLTPDLVQ 533 

I := =lh I I : I : h - llh 

1972 FVFI ERNEANEATLKKNQKMSSHHNDVI EKNYFNDNAPTAALLESPIAEEARKLVQDAVE 2 031 

534 EACE SELNEATGTKIAYETKVDLVQTSEAIQESLYPTAQLCPSFEEAEATPSPVL 588 

I I : : | :: : : | | | :||: | | |: | 

2032 SAS E YKKQAVDSGDE I GRELLDNVEQKI EQVKEP I VDSLHKAYDGVGDFVH - ETVPNAV - 2 08 9 

589 PDIVMEAPLNSLLPSAGASWQPSVSPLEAPPPVSYDSIKLEPENPPPYEEAMNVALKAL 64 8 
2 090 DDFVREAE KQLPESPVP 2106 

649 GTKEGI KEPESFNAAVQETEAPYI S IACDLI K ETKLSTEPSPDFSNYSEIAKFEK 703 

I h II I : | : | : | ||:| | 
2107 EKIETPE- PLVDIHDTVDKVHDEVDNFLRREPTPPFE 2142 



Qy 


704 


SVPEHAELVEDSSPESEPVDLFSDDSIPEVPQTQEEAVMLMKESLT EVSETVAQHKE 

: 1 : 1 1 : 1 = : M ::| III 11=1 

TDDVAPLSDDKPOFGNOT-PE EDETTFDRKGPI.TT PFFVFKAAAAONN 


760 


Db 


2143 


-L O Z> 


Qy 


761 


ERLSASPQELGKPYLESFQPNLHSTKDA ASNDI PTLTKKEKI SLQMEEFNTAI 


ft 1 ^ 


Db 


2190 


h 1 1 : 1 1 h : :||::| | :| | 

D LDDFDPLVTSNTGAAFGAAVGAAAAVESLTEEEMFGHQ- -KFETVP 


A o *± 


Qy 


814 


YSNDDLLS S KEDKI KES ETFSDSS P I E II DEFPTFVSAKDDS P - - KLAKE YTDLEVSDKS 


o / J. 


Db 


2235 


: : : 1 1 \: : || : | : | : h 

RPPTPPKDISDEDVKPS TVNLGPSHHHSHPSSPHH^ I LKHHGDAWT DFKT 




Qy 


872 


EIANIQSG ADSLPCLE LPCDL^FKNT YPKDFVH 


Q 04 

JUT 


Db 


2285 


h ■■ III h 1 II 1 :| 

VPPCAQNAFSPGEIMFLLAFFVYLSCFASFFSKSLPLLDNLLSLWYLSISLI - IH 


-J J5 Z7 


Qy 


905 


VSDE-- FSENRSSVSKASISPSNVSA LEPOTEMGSI VK^K^T.TKFAFK 


q ro 


Db 


2340 


1 1 -HI : 1 1 : :| III 

VKHHR KFRWN E EQATTMS KLGAVGRGLYAL I AF I VN I VLR VGLNVAL WG VAVSAHEA YK 


2399 


Qy 


951 


KLPSDTEKEDRSLSAVLSAELSKTSWDLLYWRDIKKTGWFGASLFLLLSLTVFSIVSV 


1010 


Db 


2400 


1 ill h|::|||| lh :| :| =| I : : = = | 
LTKS SGVLRKKEVLDVI YWRDAKKSAI VLSLALLVLFVLAKYPLLTV 


2446 


Qy 


1011 


TAYIALALLSVTISFRI YKGVIQAIQKSDEGHPFRAYLESEVAISEELVQKYSNSALGHV 
1 1 1 1 I.I.I III 1 .... 1 I i 


1070 


Db 


2447 


1 1 ' M - - I I I • I * - - * * 1 1 - • : | 

VTYSLLLALGAAAGFRVFKKVEAQIKKTDSEHPFSEILAQDLTLPQEKVHAQADVFVEHA 2506 


Qy 


1071 


NSTI KELRRLFLVDDLVDSLKFAVLMWVFTYVGALFNGLTLLI LALI SLFS I PVI YERHQ 

. 1 . . 1 1 ..I.M...I 11.11111:1 


1130 


Db 


2507 


' 1 ' * 1 1 • * • 1 • 1 1 * • • 1 1 1 - - 1*1 I 1 || | : : | | : | Ml : 1 
TCIANKLKKLVFVESPLES I KFGLVLWSLTYIASWFSGFTLAI LGLLGVFSVPKVYESNQ 2566 


Qy 


1131 


VQI DHYLGLANKSVKDAMAKI QAKI PGLK 1159 




Db 


2567 


II = 1 = = h 1 hi h 

EAIDPHLATISGHLKNVQNI IDEKLPFLR 2 595 





RESULT 6 
T13564 

microtubule-associated protein homolog - fruit fly (Drosophila melanogaster) 
N;Alternate names: hypothetical protein EG:49E4.1 
C; Species: Drosophila melanogaster 

C;Date: 13-Aug-1999 #sequence__revision 13-Aug-1999 #text__change 17-Nov-2000 
C;Accession: T13564 

R;Spanos, L.; Papagiannakis , G. ; Siden-Kiamos , I. ; Louis, C. 
submitted to the EMBL Data Library, April 1999 

A; Description: Sequencing the distal X chromosome of Drosophila melanogaster. 
A; Reference number: Z1768 9 
A; Access ion : T13564 

A; Status: preliminary; translated from GB/EMBL/DDBJ 
A; Molecule type: DNA 
A;Residues: 1-5327 <SPA> 

A; Cross-references : EMBL:AL031128 ; PIDN: CAA20006 . 1 
C; Genetics : 

A; Cross-references : FlyBase : FBgn0025392 

A;Introns: 24/2; 52/3; 104/3; 179/1; 232/1; 1669/3; 2566/1; 4798/3; 5272/1 
A;Note: EG:49E4.1 



C;Superfamily : Drosophila 576K microtubule-associated protein homolog 

Query Match 5.6%; Score 328.5; DB 2; Length 5327; 

Best Local Similarity 22.7%; Pred . No. 4.1e-05; 

Matches 255; Conservative 176; Mismatches 453; Indels 238; Gaps 54; 

Qy 2 EDIDQSSLVSSSTDSPPRPPPAFKYQFVTEP- EDEEDEEEEEDEEEDDEDLEELEVL 57 

: : I : I I : I I I ■ I I HI h II J : I I 

Db 3196 DEADKSKEESRRESGAEKSPLASKE- -ASRPASVAESIKDEAEKSKEESRRESVAEKSPL 3253 

Qy 58 ERKPAAGLSAAA- -VPPAAAAPLLDFSSDSVPPAPRGPLPAAPPAAP 102 

I 1= == I I I : I Ml I : || : : | 

Db 3254 PS KEASRPTS VAKS VKDEAEKS KEESSRDS V- - AEKS PLAS KEASRPAS VAES VQDEAEK 3311 

Qy 103 ERQPSWERSPAAPAPSLPPAAAVLPSKLPEDDEPPARPPPPPPAGASPLA-EPAA 156 

h hll I : I hi I I :: I |||| : h 

Db 3312 SKEESRRES VAEKS PLAYKEASRP -ASVAES I KDEAEKSKEESRRESVAEKS PLASKEAS 3370 

Qy 157 PPSTPA APKRRGSGSVDETLFALPAAS-EPVI PSS -AEKIMDLMEQPGNTVSSG 208 

I- I 11= II I II I I : I I I : I h 

Db 3371 RPTSVAESVKDEAEKSKEESSRDSVAEKSPLASKEASRPASVAESVQDEAEKSKEESRRE 343 0 

Qy 209 QEDFPSVLLETAASLPSLSPLSTVSFKEHG YLGNLSAVSSSEGTI EETLNEAS 261 

M Mh I I | : : | | : |: 

Db 3431 SVAEKS PIAS KEASRPASVAESVKDDAEKS KEESRRES VAEKS PLAS KEASRPAS VAESV 3490 

Qy 262 KELPERATNPFVNRDLAEFSELEYSEMG--SSFKGSPKGESAILVENTKEEVIVRSKDKE 319 

h I- :|| I I I ■ \ I I | : ! :||| :| 
Db 3491 KDEAEKSKEESRRES VAEKS PLPS KEASRPTS VAESVKDEA EKSKEE SRRE 3541 

Qy 320 DLVCSAALHSPQES-PVGKEDRW-SPEKTMDIFNEMQMSWAPVREEYADFKPFEQAWE 377 

Db 3542 S VAEKSSLASKEASRPASVAESVKDEAEKSKEESRRES VAEKS PLASKEAS -RPASVAES 3600 

Qy 378 VKDTYEGSRDV LAARANVESKVDRK- -CLEDSLEQKSLGKDSEGRNE 422 

III N::| =| - = II : : :|:: - ill 

Db 3601 VKDEAEKSKEVSRRESVAEKSPLPSKEASRPTSVAESVKDEADKSKEESRRESGAEKSPL 3660 

Qy 423 DASFP-STPEPVKDSSRAYITCASFTSATESTTANTFPLLEDHTS ENKTD - 471 

= 111 I I III : : I II I : || | h I 

Db 3 661 ASMEASRPTSVAESVKDETEK SKEESRRESVTEKS - PLPSKEASRPTSVAESVKDE 3 715 

Qy 472 -EKKIEERKAQI ITEKT S PKTSNPFLVAVQDS EADYVTTDTLSKVTEAAVSNMPEG - 526 

II M : : : lh I -I I I h : = h : I I I 

Db 3716 AEKS KEESRRES VAEKS PLAS KESSRP ASVAES I KDEAEGTKQESRRESMPESG 3769 

Qy 527 LTPDLVQEACESELNEATGTKIAYETKVDLVQTSEAIQESLYP 569 

II M: : I : hi I = |: = | 

Db 3770 KAESIKGDQSSLASKETSRPDSWESVKDETEKPEGSAI DKSQVASRPESVAVSAKD 3826 

Qy 570 TAQLCPSFEEAEATPSPVLPDIVMEAPLNSLLPSAGASWQPSVSPLEAPPPVSYDSIKL 629 

I h I I II II III:: Ihl I I 
Db 3827 EKSPLHSRPESVADKS PDASKEA SRSLSVAETASSPI EEGPRSIAD 3872 

Qy 630 EPENPPPYEEAMNVALKALGTKEGIKEPESFNAAVQETEAPYISIACDLIKETKLSTEPS 689 

— I | :| | | : - : | == | | : | 

Db 3873 LSLPLNLTGEAKG-KLP-TLSSPIDVAEGDFL EVKAESSPR 3911 



Qy 
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PDFSNYSEIAKFEKSVPEH- -AELVEDSSPESEPVDLF -SDDSIPEV 
1 : 1:1:1: 1 - I • • • 1 1 1 ■ ■ ■ 


733 


Db 


3912 


1 1 i 1 I • 1 ' * * 1 1 1 • * * • • ■ ■ - • 

P- - AVLSKPAEFSQPDTGHTASTPVDEASPVLEEI EWEQHTTSGVGATGATAETDLLDL 


3969 


Qy 


734 


PQTQEEAVMLMKESLTEVSETVAQHKEERLSASPQELGKPYLESFQPNLHSTKDAASNDI 
: 1 : 1 1 I : 1 1 - 1 1 - 1 • - -| 1 1 " . 


793 


Db 
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'1*11 I I 1 * II* 1 * * 1 1 1 ■ ■ • ... 
TETKSETV- -TKQSETTLFETLTSKVESKVEVLESSV-KQVEEKVQTSVKQAETTVTDSL 


4026 


Qy 


794 


PTLTKKEKI SLQMEE FNTAI YSNDDLLSSKEDKI KESETFSDSSP 

1 1 1 1 1 1 • 1 1 • 1 1 I-II 


838 


Db 


4027 


1 1 1 1 11*1 1 - 1 1 * ■ • 1 • 1 1 
EQLTKKS - - SEQLTEI KS VLDTNI SNVTNLFSTAVETI EKKVQDVTEKVI EKAT 


4078 


Qy 


839 


I EI IDEFPTFVSAKDDSPKLAKEYTDLEVSDKSEIANIQSGADSLPCLELPCDLSFKNI Y 
| : | | : : | : : : |' : | : : | | : | : | 


898 


Db 


4079 


-EHVSEHVT- -TTGESSTETSQEKSSLDLGTFSELRETHITTVGSPEFTV TIC 


4128 


Qy 


899 


PKDE- -VH-VSDEFSENR SSVSKAS ISPSNVSALEPQTE- MGSIVKS- 


941 


Db 


4129 


= 11 :| : :| hi | | ||: | | = | |= | : ::|| 
ERDEPVLHDIKEEDEEHRFSPPSDVDKAAI I PPQPMRPLSPREEEVAKI VADVAKVLKSD 


4188 


Qy 


942 


KSLT KEAEKKLPS - - DTEKE - DRSLSAVLSAELS 972 




Db 


4189 


KDITDI IPDFDERQLEEKLKSTADTEEESDKSTRDEKSLEIS 423 0 




RESULT 
138346 
elastic 


7 

tit in 


- human (fragment) 





C; Species: Homo sapiens (man) 

C;Date: 29-May-1998 #sequence_revision 29-May-1998 #text_change 21-Jul-2000 
C; Access ion: 13 834 6 
R;Labeit, S.; Kolmerer, B. 
Science 270, 293-296, 1995 

A; Title: Titins : giant proteins in charge of muscle ultrastructure and 
elasticity. 

A; Reference number: A57430; MUID : 96026330 ; PMID:7569978 
A; Accession : 13 8346 

A; Status: preliminary; translated from GB/EMBL/DDBJ 
A; Molecule type: mRNA 
A/Residues : 1-7962 <RES> 

A; Cross -references : EMBL : X90569 ; NID : gl017426 ; PIDN : CAA62 18 9 . 1 ; PID:gl017427 

C;Genetics : 

A; Gene: GDB : TTN - 

A; Cross -references : GDB: 127867; OMIM: 188840 
A ; Map position: 2q31-2q31 

Query Match 5.5%; Score 322; DB 2; Length 7962; 

Best Local Similarity 21.7%; Pred. No. 0.00013; 

Matches 259; Conservative 146; Mismatches 416; Indels 370; Gaps 58; 

Qy 23 AFKYQFVTEPEDEEDEEEEEDEEEDDEDLEELEVL ERKPAAGL 65 

11= ■ II h Mill l-l : I ih hll 

Db 6574 AFEEEWTHVEEYLVEEEEEYIHEEEEFITEEEWPVI PVKVPEVPRKPVPEEKKPVPVP 6633 



Qy 



66 SAAAVP PAAA 



APLLDFSSDSVPPA PRGPLP 



AAP 98 



Db 6634 KKKEAPPAKVPEVPKKPEEKVPVLIPKKEKPPPAKVPEVPKKPVPEEKVPVPVPKKVEAP 6693 

Qy 99 PA-APE- -RQPSWERSPAAPAP- - -SLPPA-AAVLPSKL- PEDDEP PAR PPPPP 144 

II II M h III III M || lh : I I : III 

Db 6694 PAKVPEVPKKPVPEKKVPVPAPKKVEAPPAKVPEVPKKLIPEEKKPTPVPKKVEAPPPKV 6753 

Qy 145 PAGAS PLAEPAAPPSTPAAPKRRGSGSVDETLFALPAASEPVIPSSAEKIMDLMEQPGNT 204 

I hill U II I -I h- |: 

Db 6754 PKKREPVPVPVALPQE EEVLF EEEIVPE- -EEVLPEEEE 6790 

Qy 205 VSSGQEDFPSVLLETAASLP SLSPLSTVSFKEHGYLGNLSAVSSSEGTI - -EETLNE 259 

' || : | || | | : || | | 

Db 6791 VLPEEEE VLPEEEEVLPEEEEI PPEEEEVPPEEEY VPEEEEFVPEEEVLPE 6841 

Qy 260 ASKELPERATNPFVNRDLAEFSELEYSEMGSSFKGSPKGESA ILVENTKEEVIV 313 

-I I I = = = I = II I I : : :h h 

Db 6842 VKPKVPVPAPVPEIKKKVTEKKW 1 PKKEEAPPAKVPEVPKKVEEKRI I 6890 

Qy 314 RSKDKEDLVCSAALHSPQESPVGKED RWS P E KTMD I FN EM 354 

l-l I hi h = h hi : - : 

Db 6891 LPKEEEVLPVEVT-EEPEEEPISEEEIPEEPPSIEEVEEVAPPRVPEVIKKAVPEAPTPV 6949 

Qy 3 55 QMSWAPVREEYADFKPFEQAWEVKDTYEGSRDVLAARANVESK 3 98 

■ I lh- 11 = 11 = :: : | 

Db 6950 P KKVEAP PAKVS KKI PEE KVP VP VQKKEA P PAKVPE VPKKVP EKKVLVPKKEAVP PA 7006 

Qy 3 99 VDRKCLEDS LEQKSLGKDSEGRNEDASFPSTPEPVKDSSRAYITCA 444 

I lh || : : : | |: | | | 

Db 7007 KGRTVLEEKVS VAFRQEVWKERLELEWEAEVEE I PEEEEFHEVEE YFEEG 7058 

Qy 445 SFTSATESTTANTFPLLEDHTSEN KTDEKKIEER KAQI ITEKTSPKT 491 

I I : hi I : H - h I I'll I 

Db 7059 EFHEVEEFIKLEQHRVEEEHRVEKVHRVIEVFEAEEVEVFEKPKAPPKGPEISEKIIPPK 7118 

Qy 4 92 SNP FL VAVQDS EADY VTTDTLS KVTEAAV SNMPEGLTPDLV 532 

II = I III : =|| I I I 

Db 7119 KPPTKWPRKEPPAKVPEVPKKIWEEKVRVPEEPRVPPTKVPEVLPPKEWPEKKVPVP 7178 

Qy 533 QEACESELNEATGTKIAYETKVDL VQTSEAI QESLYP 569 

II - II - I II : : : I- I 

Db 7179 PAKKPEAPPPKVPEAP-KEWPEKKVPVPPPKKPEVPPTKVPEVPKAAVPEKKVPEAIPP 7237 

Qy 570 TAQLCPS - - FEEAEATPS PV-LPDIVMEAPLNSLLPSAGASWQPSVSPLE 617 

■ I III I HI II -I-- I :hl H |:| = : 

Db 7238 KPESPPPEVFEEPEESPSAPPKKPEVPPVRVPEVPKEVVPEKKVPAAPPK- -KPEVTPVK 72 95 

Qy 618 AP PPVSY DSIKLEPENPPP- -YEE 639 

I M :U :|hlll =11 

Db 72 96 VPEAPKEWPEKKVPVPPPKKPEVPPTKVPEVPKVAVPEKKVPEAIPPKPESPPPEVFEE 7355 

Qy 640 AMNVALKALGTKEGIKEPESFNAA VQETEAPYISIACDLIKETKLSTEP 688 

Mh I -III M I I II : h :| 

Db 7356 PEEVALEE-PPAEWEEPEP- -AAPPQVTVPPKNPVPEKKAPAV VAKKPELPPVK 7407 

Qy 689 SPDFSNYSEIAKFEKSVPEHAELVEDSSPESEPVDLFSDDSIPEVPQTQEEAVMLMKESL 748 

h h II II II lh I - I i || : 

Db 7408 VPEVP- -KEWP-EKKVP LWPKKPEAPPA KVPEVP KEW 7444 



Qy 74 9 TEVSETVAQHKEERLSASPQELGKPYLESFQPNLHSTKDAASNDI PTLTKKEKI SLQMEE 8 08 

I. I ^ I ^ h II Ihh : I I : hh : I 

Db 744 5 PEKKVAVPKKPEVPPAKVPEVPKKPVLEE-KPAVPVPERAESPPPEVYEEPEEIAPEEE- 75 02 

Qy 809 FNTAI YSNDDLLSSKEDK IKESE TFSDSSPIEI IDEFPTFVSAKDDSPKLA 859 

- :|:| : I i I :|| I I I -I 

Db 7503 IAPEEEKPVPVAEEEEPEVPPPAVPEEPKKIIPEKKVPVIKKPEAPP-P 7550 

Qy 860 KEYTDLEVSDKSEIANIQSGADSLPCLELPCDLSFK-NIYPKDEVHVSDEFSENRSSVSK 918 

II =1 =1 == I I = h II = | II I 

Db 7551 KEPEPEKVI EKPKLKPRPPPPPPAPPKEDVKEKI FQLKAI PKKKV PENPQVPEK 7604 

Qy 919 ASISPSNVSALEPQTEMGSIVKSKSLTKEAEKKLP- -SDTEKEDRSLSAVL 967 

-III hi II ||= I =|| 

Db 7605 VELTPLKVPGGE KKVRKLLPERKPEPKEEWLKSVL 7640 



.RESULT 8 
T26213 

hypothetical protein W06A7.3b - Caenorhabdit is elegans 
C; Species: Caenorhabdit is elegans 

C;Date: 15-Oct-1999 #sequence_revision 15-Oct-1999 #text_change 15-Oct-1999 
C; Access ion: T2 6213 
R;Ains cough, R. 

submitted to the EMBL Data Library, August 1996 
A; Reference number: Z20173 
A /Access ion: T26213 

A; Status : preliminary; translated from GB/EMBL/DDBJ 
A; Molecule type: DNA 
A;Residues: 1-222 <WIL> 

A; Cross-references: EMBL:Z78066; PIDN : CAB01523 . 1 ; GSPDB : GN00023 ; CESP : W06A7 . 3b 
A; Experimental source: clone W06A7 
C; Genetics: 

A; Gene: CESP : W06A7 . 3b 

A ; Map position: 5 

A;IntronS: 27/1; 77/2; 201/2 



Query Match 5.5%; Score 32 0; DB 2; Length 222; 

Best Local Similarity 32.1%; Pred. No. le-06; 

Matches 59; Conservative 47; Mismatches 78; Indels 0; Gaps 0; 



Qy 


976 


WDLLYWRDI KKTGWFGASLFLLLSLTVFS I VSVTAYI ALALLSVTI SFRI YKGVI QAI 


1035 


Db 


27 


-l-llll Ih U H H 1 " = = : = f 1 1 1 |h = f 1 1 
I LDVI YWRDAKKSAI VLSLALL VLFVLAKYPLLTWTYSLLLALGAAAGFRVFKKVEAQI 


86 


Qy 


1036 


QKSDEGHPFRAYLESEVAISEELVQKYSNSALGHVNSTIKELRRLFLVDDLVDSLKFAVL 
:|:| Ml | :: : :| | :: : | :|::| |: = = |:|| =: 


1095 


Db 


87 


KKTDSEHPFSE I LAQDLTLPQEKVHAQAD VF VEHATCI ANKLKKLVFVESPLES I KFGLV 


146 


Qy . 


1096 


MWVFTYVGALFNGLTLLI LALI SLFS I PVI YERHQVQI DHYLGLANKSVKDAMAKI QAKI 


1155 


Db 


147 


= 1 Ih ^ hi II II h HIH HI H II = i : :h 1 h 
LWSLTY I ASWFSGFTLAI LGLLGVFS VPKVYESNQEAI DPHLAT I SGHLKNVQN I I DEKL 


206 



Qy 1156 PGLK 1159 

I h 

Db 207 PFLR 210 



RESULT 9 
A47283 

calphotin - fruit fly (Drosophila melanogaster) 
C; Species: Drosophila melanogaster 

C;Date: 21-Sep-1993 #sequence_revision 25-Apr-1997 #text_change 21-Jul-2000 
C; Accession: A4 7283 

R;Ballinger, D.G.; Xue, N. ; Harshman, K.D. 

Proc. Natl. Acad. Sci . U.S.A. 90, 1536-1540, 1993 

A; Title: A Drosophila photoreceptor cell -specif ic protein, calphotin, binds 
calcium and contains a leucine zipper. 

A; Reference number: A47283; MUID : 93165730 ; PMID:8434015 

A; Contents : photoreceptor cells 

A; Access ion: A4 7283 

A; Status: preliminary 

A;Molecule type: nucleic acid 

A;Residues: 1-873 <BAL> 

A; Cross-references : GB:L05080; NID:gl57071; PIDN: AAA28420 . 1 ; PID:gl57072 
A;Note: sequence extracted from NCBI backbone (NCBIN : 124 958 , NCBIP : 124959) 
C;Superfamily: collagen alpha 1(1) chain; fibrillar collagen carboxyl -terminal 
homology; von Willebrand factor type C repeat homology 

Query Match 5.2%; Score 3 04.5; DB 2; Length 873; 

Best Local Similarity 21.9%; Pred. No. 3.1e-05; 

Matches 213; Conservative 126; Mismatches 362; Indels 273; Gaps 42 

Qy 62 AAGLSAAAVPPAAAAPLLDFSSDSVPPAPR- -GPLPAAP PAAP ERQPSWER 110 

: I :H I I Mh I U II I I I I I I || lh 

Db 11 SAPVAAPVTPSAVAAPVQWSPAAVAVAPAWAPAPAAPTAVTPVAPPPTLASVQPATVT 70 

Qy HI SPAAPAP SLPPAAAVLPSKLPEDDEPPARPPPPPP AGASPLAEPAAPP 158 

Mill II IN || III 

Db 71 VP-APAPIAAASVAPVASVAPPWAAPTPPAASPVSTPPVAVAQIPVAVSAPVAPPVAAT 12 9 

Qy 159 STPAAPKRRGSGSVDETLFALP- -AASEP VI PSSAEKIMDLMEQPGNT VSSGQ 209 

II II I = I I III I I I : I I II h: 

Db 13 0 PTPVAPI PVAAPVIATPPVAASAPTPAAVTPWSPVIATPPWPANTTVPVAAPV 184 

Qy 210 EDFPSVLLETAASL-PSLSPLSTVSFKEHGYLGNLSAVSSSEGTIEETLNEASKELPERA 268 

Db 18 5 AAVPAAVPWAPVLAPAVAPAVAPWAETPAPPPVAEIPVA--TIPECVAPLIPEVSWA 242 

Qy 269 TNPFVNRDLAEFSELEYSEMGSSFKGSPKGESAILVENTKEEVIVRSKDKEDLVCSAALH 328 

M I hi I I : I 

Db 243 TKPLA AAEPWVAPPATETPWAPAAA 269 

Qy 329 SPQESPVGKEDRWSPEKTMDIFNEMQMSWAPVREEYADFKPFEQAWEVKDTYEGSRDV 388 

II I hi - : I I I I I 

Db 270 SPHVS VAP AVETAWAPV 2 87 

Qy 38 9 LAARANVESKVDRKCLEDSLEQKSL GKDSEGRNEDASFPSTPEPVKDSSRAYI TCA 444 

h I I I : I :| : I : | || | :: 

Db 288 SASTEPPVAAATLTTAPETPALAPWAESQVAANTWATPPTPAPEPETIAPPWAE 344 



Qy 



445 SFTSATESTTANTFPLLEDHTSENKTDEKKIEERKAQI ITEKTSPKTSNPFLVAVQDSEA 504 



Db 


345 


TPEVASVAVAETTPPWPPVAAES - I PAPWATTPVPATLAVTDPD- 


389 


Qy 


505 


DYVTTDTLSKVTEAAVSNMPEGLTPDLVQEACESELNEATGTKIAYETKVDLVQTSEAIQ 564 
! 1 . 1 1 . ! . 1 1 1 i i i i i 


Db 


390 


f i • i i • 1 - I 1 1 1 1 1 II 

VTASAVPELPPVIAPSPVPSA - VAETPVDLA P 


420 


Qy 


565 


ESLYP-TAQLCPSF- -EEAEATPSPVLPDI VMEAPLNS LLP SAGAS WQPSVS PLEA P- - 
1 1 1= h II Ihl : : 1 h =| : || :| ||| 


619 


Db 


421 


PVLPPVAAEPVPAWAEETPETPAPASAPVTI -AALD- - I PEVAPVIAAPSDAPAEAPSA 


477 


Qy 


620 


--PPVSYDSIKLEPENPPPYEEAMNVALKALGTKEGIKEPESFNAAVQETEAPYISIACD 677 


Db 


478 


III II | : | | | || : : : 
AAP IVS TPPT TAS VPETTAP PAAVPTE 


504 


Qy 


678 


LIKETKLSTEPSPDFSNYSEIAKFEKSVPEHAE- -LVEDSSPESEPVDLFSD-DS 


729 


Db 


505 


= = 1 : 1 : 1 = hl ■■■■WW -1 1=11 I 

PIDVSVLSEAAI ETPVAPPVEVTTEVAVADVAPPEAAEDLI I EPVEPPAPI PDLLEQTTS 


564 


Qy 


730 


I PEVPQTQEEAVMLMKESLTEVSETVAQHKEERLSASPQELGKPYLESFQPNLHSTKDAA 78 9 
-II i 1 .III ... 1 .1 l .1.1. 


Db 


565 


* " i * * * • i i -ill • • • 1 -1 1 : 1 : 1 : : 
VPAVEAAESTSSPIPETSLPPPNEAVA SPEVAVAPITAPEPI PEP-EPSLATPTEPI 


620 


Qy 


790 


SNDI PTLTKKEKISLQMEEFNTAI YSNDDLLSSKEDKI KESETFSDSSPIEI IDEFPTFV 


849 


Db 


621 


: 1 - 1 =: 1 : =| 1 III 1 

PVEAPV VIQEAVDAVEVPVTETSTSI PETTVEFPEAV 


657 


Qy 


■ 850 


SAKDDSPKLAKEYTDLEV- SDKSE I ANI QSGAD SLPCLELPCDLSFKNI YPKDEVH 


904 


Db 


658 


: 1 1 = h 1 = = -III II : 1 :h | 
AEKVLDPAI TEAP VTTQE PDVANI NDGAPATE I TTPAVE I VT 


699 


Qy 


905 


VSDEFSENRSSVSKASISPSNVSALE-PQTEM- -GSI VKSKSLTKEAEKKLPSDTEKEDR 


961 


Db 


700 


AAAEVSDTAI PLI DPPV- PQEIAVAEI PETETKPAEVI VEQS - T I PI EAPVPEVSKYAEP 


757 


Qy 


962 


SLSAVLSAELSKTS 975 




Db 


758 


= 1 = 11 = h 
VISEAPAAEVPITA 771 





RESULT 10 
T24583 

hypothetical protein T06D8 . 1 - Caenorhabditis elegans 
C; Species: Caenorhabditis elegans 

C;Date: 15-Oct-1999 #sequence_revision 15-Oct-1999 #text_change 15-Oct-1999 
C; Access ion: T24 583 
R; Palmer, S. 

submitted to the EMBL Data Library, April 1995 
A; Reference number: Z19909 
A ,-Access ion: T24583 

A; Status : preliminary; translated from GB/EMBL/DDBJ 
A ,-Molecule type: DNA 
A;Residues: 1-1829 <WIL> 

A; Cross -references: EMBL:Z49130; PIDN : CAA88964 . 1 ; GSPDB : GN00020 ; CESP:T06D8.1 

A; Experimental source: clone T06D8 

C; Genetics: 

A;Gene: CESP : T06D8 . 1 

A; Map position: 2 



A;Introns: 1391/3; 1432/3; 1470/3; 1505/1; 1520/1; 1616/1; 1644/1; 1687/3; 
1742/1 

Query Match 5.2%; Score 302.5; DB 2; Length 182 9; 

Best Local Similarity 21.1%; Pred. No. 0.00011; 

Matches 233; Conservative 157; Mismatches 469; Indels 245; Gaps 38; 

Qy 2 EDIDQSSLVSSSTDSPPR PPPAFKYQFVTEPEDEE DEEEE E 42 

1= f I I III II : II I 

Db 212 EETTWAWESSGEEPASSSTSIPTELSKNDQVTEASGEETITAAATEASEETTTSAVTE 271 

Qy 43 DEEEDDEDLEELEVXjERKPAAGLSAAAVPPAAAAPLLDFSSDSVPPAPRG---PLPAAPP 99 

II ^ :h Hh h -I = I I I || 

Db 272 GSGEDTTWAWELSGEQPAS- - SSTSI P TELSKDDQVTEASGEETTTAAATE 322 1 

Qy 100 AAPERQPS WERSPAAPAPSLPPAAAVLPSKLPEDDE- PPARPPPP 143 

h I I Mill: H-l :|h I 

Db 323 ASEETTTSAVTEGSGEETTWAWESSGEEPASS STSI PTELSKDDQVTEASGEET 378 

Qy 144 PPAGASPLAEPAAPPSTPAAPKRRGSGSVDETLFAL PAASEPVI PSSAEKIMD 196 

I h : I I : Ml | h h ||:| ||: | 

Db 379 TTAAATEASE ETTTSAVTEGSGE-DTTWAWESSGEQ PAS SSTSI PTELSKDDQ 432 

Qy 197 LMEQPGN TVSSGQEDFPSVLLETAASLPSLSPLSTVSFKEH 237 

= 11 I lh: -I- h | | : 

Db 433 VTEASGEETTTAAATEASEETTTSAVTEGSGEDTTWAWESSGEQPASSSTSIPT 488 

Qy 238 GYLGNLSAVSSSEGTIEETLNEASKELPERATNPFVNRDLAEFSELEYSEMGSSFKGSPK 297 

I h : I III hill | | : : : : || : 

Db 48 9 -ELSKDDQVTEASG--EETTTAAATEASEETTTSAVTEGSGEETTV-VAWESSGEEPAS 544 

Qy 298 GESAI LVENTKEEVI VRSKDKEDLVCSAALHS PQES PV GKEDR WS P E KTMD I 350 

-I I =l = : ■ ■ 'I H I =|= hi I h = = I 

Db 54 5 SSTSIPTELSKDDKVTEASGEETTTAAATDASSEETTTSAVTEGSGEETTWAWESSD- 603 

Qy 351 FNEMQMSWAPVREEYADFKPFEQAWEVKDTYEGSRDVLAARANVESKVDRKCLEDSLEQ 410 

II II | Ml 

Db 604 -EEPASSSTSIPTELSKDDQVTEASGEETTT AAATEASEETTTSAVTEGSGEE 655 

Qy 411 KSLGKDSEGRNEDASFPSTPEPVKDSSRAYITCASFTSATESTTANTFPLLEDHTSENKT 470 

= = I I- II hi : I I I h||| : h : 

Db 656 TTWAWESSGEEPASSSTSI PTELSKDDKVTEA SGEETTTAAATDASSEETTTSAV 712 

Qy 471 DEKKIEERKAQI ITEKTSPK-TSNPFLVAVQDSEADYVTTDTLSKVTEAAVSNMPEGLTP 52 9 

Db 713 TEGSGEETTWAWESSDEEPASSSTSIPTELSKDDQWEASGEETTTAAATEASEETTT 772 

Qy 53 0 DLVQEACES ELN ----- EATG TKI AYETKVDLVQTSEAI QESLYPTAQLCPSF 577 

I I I hH I I I I hi I | 
Db 773 SAVTEGSGEETTWAWESSGEEPASSSTSIPTELSKD-DQVTEASGEE TTTAAATE 828 

Qy 578 EEAEATPSPVLPDIVMEAPLNSLLPSAGASWQPSVSPLEAPPPVSYDSIKLEPENPPPY 637 

I I I I : : ::: | : | ||: | | :| 
Db 82 9 ASEETTTSAVTEGSGEDTTWAWESSGE QPASSSTSI PTELS 871 

Qy 63 8 EEAMNVALKALGTKEGIKEPESFNAAVQETEAPYISIACDLIKETKLSTEPSPDFSNYSE 6 97 



Db 


872 


KDDOVTEASGEETTT AAATFA^FFTTTQAVTFfi^nFFTTV 


J? -L -L 


Qy 


698 


IAKFEKSVPEHAELVEDSSPESEPVDLFSDDSIPEV PQTQEEAVMLMKESLTEVSET 


754 


Db 


912 


: 1 II II II 1 1 :| II = 1 | | | : 

VAWESSGEEPA SSSTS I PTELSKDDOVTEA^nEFTTTA A ATFA QQFFTTTCJAVT 


2? D O 


Qy 


755 


VAQHKEERLSASPQELGKPYLESFQP NLH^TKDAA^N 


791 


Db 


967 


= 1 M : h 1 1 : |:::::| 

EGSGEETTTSAVTEGSGEETTTSAVPEGENSTTEAPAFVTnc;PTPTPc;qppqccTTTHnP 




Qy 


792 


DIPTLTKKEKISLOMEEFNTAIYSNDDLLSSKEDKIKESET-F^DciqpTFTTnFFPTFVc: 


OCA 

ODD 


Db 


1027 


II =1 1 H H : 1 : :::: | | :| | : | | | ||: 

SIPVITPKPSVSSTIENVMSKTSSEE AAEKKI IGEHQTGKDDDAGKEDEDNMPAFVT 


1 AQO 
lUO J 


Qy 


851 


* * L\-i-J l-J IT 1\ 1 irilVLj 1 X -L/XJJZj V OJ_JJ\-iZJL^j X.MXN] J iM 1 J/~l 1 iKt | )r.l iPl | J 

1 - = : hill =| ::=| 


QQ1 


Db 


1084 


ANPAGT^TTE^AFMVT^THFFnFKIT KMAKFT.HK'nFAanT hVl A 


1126 


Qy 


892 


LSFKNIYPKDEVHVSDEFSENRSSVSKASISPSNVSALEPQTEMGSIVKSKSLTKEAEKK 


951 


Db 


1127 


II h- 1 : 1 1 ■ Ih 1 : ■ 1 1 : - || - 

AKDGVNLT-ETADAKDSGETAHVEDEQVSSTE- -SSIGSEETTTTVNKETTEE 


1176 


Qy 


952 


LPSDTEKEDRSLSAVLSAELSKTS 975 




Db 


1177 


HHEASGEEDDAPAF VTGAPTDSTT 12 0 0 





RESULT 11 
A47282 

calcium-binding protein calphotin - fruit fly (Drosophila melanogaster) 
C; Species: Drosophila melanogaster 

C;Date: 21-Sep-1993 #sequence_revision 18-Nov-1994 #text_change 21-Jul-2000 
C; Access ion: A4 7282 

R;Martin, J.H.; Benzer, S.; Rudnicka, M. ; Miller, C.A. 
Proc. Natl. Acad. Sci . U.S.A. 90, 1531-1535, 1993 

A;Title: Calphotin: a Drosophila photoreceptor cell calcium-binding protein. 

A; Reference number: A47282; MUID : 93 165729 ; PMID:8094559 

A;Accession: A47282 

A; Status: preliminary 

A;Molecule type: nucleic acid 

A;Residues: 1-865 <MAR> 

A; Cross-references : GB:L02111 ; NID:gl57031; PIDN : AAA284 05 . 1 ; PID:gl57032 
A; Experimental source: photoreceptor cells 

A;Note: sequence extracted from NCBI backbone "(NCBIN: 124955, NCBI P : 124956) 

C; Genetics : 

A; Gene: FlyBase: Cpn 

A; Cross-references : FlyBase : FBgn0010218 

C;Superfamily: collagen alpha 1(1) chain; fibrillar collagen carboxyl -terminal 
homology; von Willebrand factor type C repeat homology 
C; Keywords: calcium binding 

Query Match 5.1%; Score 2 95.5; DB 2; Length 8 65; 

Best Local Similarity 21.3%; Pred. No. 7.1e-05; 

Matches 217; Conservative 127; Mismatches 379; Indels 295; Gaps 41; 
Qy 62 AAGLSAAAVPPAAAAPLLDFSSDSVPPAPRGPLPAAPPAAPER QPSWERSPAAPAP 117 



Db 


11 


SAPVAAPVTPSAVAAPVQWSPAAVAPAPAA.PIAVTPVAPPPTLASVQPATVTIP-APAP 


69 


Qy 


118 


SLPPAAAVLPSKLPEDDEPPARPPPPPPAGA SPLAEPAAPPSTP 


161 


Db 


70 


1= 1 hi 1 = III III :hl I 1 || 

IAAASVTPVASVAPPWAAPTPPAASPVSTPVAVAQI PVAVSAPVAPPVAATPTPWQI P 


19 9 


Qy 


162 


-AAPKRRGSGSVDETLFALP--AASEP VI PSSAEKIMDLMEQPGNT VSSGQED 

Ml II 1 1 : | | || |:: 


911 
z i i 


Db 


130 


VAAP VI ATP PVAASAPTPAAVTP VI S P VI A ^ PP WPANTTVP VA A P VA A 


1 70 

± 1 0 


Qy 


212 


FPSVLLETAASL-PSLSPLSTVSFKEHGYLGNLSAVSSSEGTIEETLNEASKELPERATN 


270 


Db 


179 


VPAAVPWAPVLAPAVAPAVAPWAETPAPPPVAEI PVA- -TI PECVAPLI PEVSWATK 


236 


Qy 


271 


PFVNRDLAEFSELEYSEMGSSFKGSPKGESAILVENTKEEVIVRSKDKEDLVCSAALHSP 


~D J U 


Db 


237 


1 1 hi 1 1 : 1 II 


9£7 


Qy 


331 


QESPVGKEDRWSPEKTMDIFNEMQMSWAPVREEYADFKPFEQAWEVKDTYEGSRDVLA 


O ZJ \J 


Db 


264 


HVS VAP AVETAWAPV 


279 


Qy 


391 


ARANVESKVDRKCLEDSLEQKSL GKDSEGRNEDASFPSTPEPVKDSSRAYITCASF 


446 


Db 


280 


h 1 1 1 : 1 :| : | : | || | : = 

-SASTEPPVAAATLTTAPETPALAPWAESQVAANTWATPPTPAPEPETIAPPWAETP 


338 


Qy 


447 


TSATESTTANTFPLLEDHTSENKTDEKKIEERKAQI ITEKTSPKTSNPFLVAVQDSEADY 506 
1 . - 1 1 - - . 1 . I 11.11! 


Db 


339 


i • • 1 i • • l 1 1 1 • II 1 : 
EVASVAVAETTPPWPPVAAES 1 PAPWATTPVPATLAVTDPD 


381 


Qy 


507 


VTTDTLSKVTEAAVSNMPEGLTPDLVQEACESELNEATGTKIAYETKVDLVQTSEAIQES 


566 


Db 


382 


II I II III 

VTASAVPELPPVIAPSPVPSA VAETPVDLA PPV 


414 


Qy 


567 


LYP-TAQLCPSF- -EEA EAT PSPVLPDI VMEAPLNS LLP SAGAS WQPSVS PLEA P 


619 


Db 


415 


M h h II III : : I h =| : II :| III 

LPPVAAEPVPAWAEETPETPAPASAPVTI -AALD- - I PEVAPVIAAPSDAPAEAPSAAA 


471 


Qy 


620 


PPVSYDSIKLEPENPPPYEEAMNVALKALGTKEGIKEPESFNAAVQETEAPYISIACD- - 
1 M M hi II II - : 


677 


Db 


472 


PIVS TPPT TASVPETTAPPAAVPTEPI 


498 


Qy 


678 


LIKETKLSTEPSPDFSNYSEIAKFEKSVPEHAE- -LVEDSSPESEPVDLFSD-DSI P 

. . 1 .1 .1 .i.f ..mi i i ii i i 


731 


Db 


499 


' " 1 'I -1 ■ 1 - 1 • : III : : 1 1 : || | : | 
DVSVLSEAAI ETP VAP P VEVTTE VAVADVAP P EAAADL HE PVE P PAP I PDLLEQTTS VP 558 


Qy 


732 


EVPQTQEEAVMLMKESLTEVSETVAQHKEERLSASPQELGKPYLESFQPNLHSTKDAASN 
1 = = = : II =1 II - =| =| | :|:| : : 


791 


Db 


559 


AVEAAESTSS PI PETSLPPPNEAVA SPEVAVAP I TAPEPI PEP - EPSLATPTEP I PV 


614 


Qy 


792 


D I PTLTKKEKI SLQMEEFNTAI YSNDDLLSS KEDKI KES ETFSDSS P I E 1 1 DEFPTFVSA 


851 


Db 


615 


* 1 1 == 1 = =1 1 III h 
EAPV VI QEAVDAVEVPVTETSTS I PETTVEFPEAVAE 


651 


Qy 


852 


KDDSPKLAKEYTDLEV-SDKSEIANIQSGAD SLPCLEL PCDLSFKNI YP- 


899 


Db 


652 


KVLDPAI TEAPVTTQEPDVANI NDGAPATEI TTPAVE I VTAAAEVSDI AI PVI DPP 


707 



Qy 900 -KDEVHVSDEFSENRSSVSKASISPSNVSALEPQTEMG 936 

|: |: | | : :: : | : | |: 

Db 708 VPQE I AVA -EI PETDTKPAE VI VEQSTI P I EAP VPEVSKYAEP VI S EAPAAEVP I TAGDN 766 

Qy 937 SIVKSKSLTKEAEKKLPSDTEKEDRSLSAVLSAELSKTSWDLLYWRDIKKTGV 990 

h h - III : I h I = : II II- I I 

Db 767 PDNTSVGISEWPTIAEKPVEEVPTSEIPEQSSSPSDSVPVAKITPLL- -RDLQTTDV 822 



RESULT 12 
T34513 

hypothetical protein ZK783.1 - Caenorhabditis elegans 
C; Species: Caenorhabditis elegans 

C;Date: 29-Oct-1999 #sequence__revision 29-Oct-1999 #text_change 29-Oct-1999 
C; Accession: T34 513 

R;Favello, A. ; Vaudin, M . ' 

submitted to the EMBL Data Library, August 1994 

A; Description: The sequence of C. elegans cosmid ZK783 . 

A;, Reference number: Z21536 

A; Accession: T34513 

A;Status: preliminary; translated from GB/EMBL/DDBJ 

A; Molecule type: DNA 

A; Residues: 1-3507 <FAV> 

A; Cross-references : EMBL:U13646; PIDN: AAC24418 . 1 ; GSPDB : GN00021 ; CESP: ZK783 . 1 

A; Experimental source: strain Bristol N2 ; clone ZK783 

C; Genetics : 

A; Gene: CESP:ZK783 .1 

A; Map position: 3 

A;Introns: 14/1; 48/2; 84/1; 196/3; 303/1; 381/1; 586/1; 605/1; 1175/3; 1207/1; 
1409/2; 1450/1; 1760/1; 1898/1; 2153/1; 2628/1; 2681/1; 2899/1; 2946/1; 3088/3; 
3184/1; 3283/1; 3346/3; 3365/3; 3484/3; 3504/1 

Query Match 5.0%; Score 2 92; DB 2; Length 3 507; 

Best Local Similarity 20.6%; Pred. No. 0.00071; 

Matches 226; Conservative 146; Mismatches 491; Indels 236; Gaps 37; 

Qy 11 SSSTDSPPRPPPAFKYQFVTEPEDEEDEEEEEDEEEDDEDL- - EELEVLERKPAAGLSAA 68 

llh-l II : :|| : HI I I : II I : I 

Db 2043 SSSSEAPLTSSPATTTEVITESSVKSTTPKEESSSEITVKLSSKSPEVTESSVKSSPSTP 2102 

QY 69 AVPPAAAAPLLDFSSDSVPPAPRGPLPAAPP AAPERQPSWERSPA APAPSL 119 

: :| | : |: : | :|:|| | : || 

Db 2103 STTSQSVTSTVPETSKSTVLSSEAPVTSTSPTEVHTSSETKPSLSASSTTGDTNSTTPST 2162 

Qy 12 0 PPAAAVLPSKLPEDDEPPARPPPPPPAGASP-LAEP AAPPSTPAAPKRRGSGSV 172 

hi = II I I = II -:| I II I : II 

Db 2163 SSLASVKSTSAPEGTS - -ASVAPVKLSSLSPDVSQPSTKTFDATESSTVQASETSSGTSV 222 0 

Qy 173 DET LFALPAASEPVI -PSSAEKIMDLMEQPGNTVSSGQEDFP SV 215 

I = h = l II II = = III =| III I I 

Db 2221 KSTSEPESHVTKLSITSSNPSSSVPVTSPKSTPTVPESTEQPTSTTPSGQSLTPMNSNSE 2280 

Qy 216 LLETA ASLPSLSPLSTVSFKEHGYLGNLSAVSSSEGTI EETLNEASKELPERAT- -N 270 

: I 1= I I I I = I I I I = : = I HI I I = 

Db 2281 VLTTSEPHVLSSSLSPDVSQSSTTPNNLSESSTVETPKTSSEVSLNSEEPSTTEAPTTLS 2340 



Qy 271 PFV NRDLAEFSEL EYSEMGSSFKGSPKG ESAILVENTKEEVI VRSKD 317 

I ^ :h= I : I I - I I I I - :| I : 

Db 2341 PDILSTTTNNLSQSSTVSTEDRSEISSENSEKPTSAPELVTSSVTHVASSSPDVPTESSE 2400 

Qy 318 KEDLVCSAALHSPQ ESPVGKEDRWSPEKTMDI FNEMQMSWAPVREEYADFK 370 

■W 1 = ■ h I : III :| : I I : 

Db 24 01 PDDLTGSSTENI PEASSKQTI SSTPTPDTTTASEEPTKSTSMSPDLSTTSNVLSESSTTP 2460 

Qy. 371 PFEQAWEVKDTYEGSRDVLAARANVESKVDRKCLEDSLEQKSLGKDSEGRNEDASFPSTP 430 

I : II - =111 =11 II : :|| 

Db 2461 ESSSKSPVSSSTEG ISWTSTEFSKVPESTISSVLE EDLT-KTTP 2504 

QY 431 E PVKDS SRAY I TCAS FTSAT ESTTANTFPLLEDHTSENKTDEKKIEE 477 

h : : I II II | ||:: | :: :: | : 

Db 2505 SPILEET TTASETSEPLTEDSLTVSVRIHELTTSSENVPKESESTTTSSESSKPSQ 2560 

Qy 4 78 RKAQI ITEKTSPKTSNPFLVAVQDSEADYVTTDTLSKVTEAAVSNMPEGL TPDLV 532 

i hi Ih h : II :|-| | -hi :| | 

Db 2561 EPAGI LTSTVWPTSS VSLITASEI EA- - 1 TSNTPFKQGRTP I TTS PKSLVKSTTS PSTV 2618 

Qy 533 - -QEACESELNEATGTKIAYETKVDLVQTSEAIQESLYPTAQLCPSFEEAEATPSPVLPD 590 

III I == I : |||:= : h h I- :| I 

Db 2619 TSSEPSESTKRTTVSTTVSTTTPTEETTTSESLILTAAPSK PTESTTESSEAPTTP - 2674 

Qy 591 IVMEAPLNSLLPSAGASWQPSVSPLEAPPPVSYDSIKLEPENPPPYEEAMNVALKALGT 650 

I I HI ||: : : 

Db 2675 AKTSETKPS NVSSTSRKS 2692 

Qy 651 KEGIKEPESFNAAVQETEAPYISIACDLIKETKLSTEPSPDFSNYSEIAKFEKSVPEHAE 710 

I : : | : : : : : | || |= : : | | | 

Db 2693 TENVETSTSQSGSLESS TMSSTSSEPETNAPAVTVSSEASSTTLEE 2738 

Qy 711 LVEDSS PESEPVDLFSDDS I PEVPQTQEEAVMLMKESLTEVSETVAQHKE - ERLSAS PQE 769 

III I = II III : : I- = hi :|: | | 

Db 2739 NSSTSSPTSSEASVKLSSLFPE- -SITSEAVTVSSRAPAEITMSSESHREISTVSSEPSE 2796 

QY 77 0 LGKPYLESFQPNLHSTKDAASNDI PTLTKKEKI SLQMEEFNTAI YSNDDLLSS 822 

I : Ih : hi!: | I : |||: | 

Db 2797 PEI PLSTTVS PNWTASS I PSEE - P I LSSVTSSSTPRVRLI TG - - TPDDLI VSVTVPSHG 2853 

QY 823 - KEDKI KESETFSDS - SP I EI I DEFPTFVSAKDDS PKLAKEYTDLEVSDKSEI ANI QSGA 880 

= I I 1 = 1 III : I I : II I || 

Db 2854 NRRQNITASSVPSNSTSPI ILPSESLTTPQPPPTTTTTAKPAT TSGK 2900 

QY 881 DSLPCLELPCDLSFKNI YPKDEVHVSDEFSENRSSVSKASISPSNVSALEPQTEMGSI VK 940 
Db 2901 RGPPSIQPPAEM FTTPAPPPPSNGGYGEE 2929 

QY 941 SKSLTKEAEKKLPSDTEKEDRSLSAVLSAELSKTSWDLLYWRDIKKTGWFGASLFLLL 1000 

I ^ h- Mill::: I : III h 

Db 2930 TNQEEEQVTSTTTTEAPSLCSTVTCHSLATCE QSTGVCICRDGFIGD 2976 

Qy 1001 SLTVFS I VS VTAY I ALALL 1019 

III hi I 

Db 2977 GTTACSKKSTADCISLPSL 2995 



RESULT 13 
A56577 

microtubule-associated protein MAP IB - rat (fragment) 
C;Species: Rattus norvegicus (Norway rat) 

C;Date: 21-Jul-1995 #sequence_revision 21-Jul-1995 #text_change 16-Feb-1997 
C;Accession: A56577 

R;Zauner, w. ; Kratz, J. ; Staunton, J. ; Feick, P.; Wiche, G. 
Eur. J . Cell Biol. 57, 66-74, 1992 

A;Title: Identification of two distinct microtubule binding domains on 
recombinant rat MAP IB. 

A;Reference number: A56577; MUID : 92347374 ; PMID: 1639092 

A; Access ion: A56577 

A; Status : preliminary 

A;Molecule type: mRNA 

A; Residues: 1-2364 <ZAU> 

A; Cross -references : GB:X60550 

A; Experimental source: brain 

A;Note: nucleotide sequence not given; conceptual translation not complete 
C;Superfamily: microtubule-associated protein MAP1B 

Query Match 5.0%; Score 291.5; DB 2; Length 2364; 

Best Local Similarity 20.0%; Pred. No. 0.00043; 

Matches 220; Conservative 167; Mismatches 456; Indels 255; Gaps 44; 
Qy 30 TEPEDEEDEEEEEDEEEDDEDLEELEVLERKPAAGLSAAAVPPAAAAPLLDFSSDSVPPA 8 9 
Db 913 SEEEGEEEEDKAEDAREEDHEPDKTE AEDYVMAWDKAAEAGVTEDQYDFL 963 

Qy 90 PRGPLPAAPPAAPERQPSWERSPAAPAPSLPPAAAVLPSKLPEDDEPPARPPPPPPAGAS 14 9 

I =1 I :-|| lh:: II II II 
Db 964 GTPAKQ PGVQSPSREPASSIHDETLPGGSESEAT AS 999 

QY 150 PLAEPAAPPSTPAAPKRRGSGSVDETLFALPAASEPVI PSSAEKIMDLMEQPGN TVS 206 

II II I : MM hi IN 

Db 1000 DEENREDQPEEFTAT SGYTQST 1 EI SSEPTPMDEMSTPRDVMTDETNNEETES 1052 

QY 207 SGQE DFPSVLLETAASLP SLSPLS TVSFKEHGYLGNLSAVSSSEGTI E 254 

M : I | I I I : || | : | = | =| | 

Db 1053 PSQEFVNITKYESSLYSQEYSKPWASFNGLSDGSKTDATDGRDYNASASTISPPSSMEE 1112 

Qy 255 ETLNEAS KELPERATNPFVNRDLAEF- -SELEYSEMG 28 9 

Db 1113 DKFSKSALRDAYRPEETDVKTGAELDIKDVSDERLSPAKSPSLSPSPPSPIEKTPLGERS 1172 

Qy 290 SSFKGSPKGESAILVENTKEEVIVR SKDKEDL VCSAALHSP 330 

= I I = lh :| : =| I ::: | | :| |:| 

Db 1173 VNFSLTPNEIKASAEGEATAWSPGVTQAWEEHCASPEEKTLEWSPSQSVTGSAGHTP 1232 

Qy 331 - QES PVGKEDRWS PEKTMDI FNEMQMS WAPVREE YADFK PFEQAWEVKD 380 

= 11 == : I I : : II h : I I - I I 

Db 1233 YYQS PTDEKSSHLPTEVT ENAQAVPVS FEFTEAKDENERSS I S PMDE - - PVPD 1283 

Qy 38*1 TYEGSRDVLAARANVESKVDRKCLED - - SLEQKSLGKDS EGRNEDASFPSTPEPVK 434 

lh - MM hlh I Ihl I II 

Db 1284 SESPIEKVLSPLRSPPLIGSESAYEDFLSADDKALGRRSESPFEGKNGKQGFSDKESPVS 1343 



Qy 



435 DSSRAYITCASFTSATESTTANTFPLLEDHTSENKT 



DEKKI 475 



Db 


1344 


D LTSDLYQDKQEEKRAGFI P I KEDFS PEKKASDAE I MSSQSALALDERKLGGDGS 


1398 


Qy 


476 


EERKAQ 1 1 TEKTS PKTSNPFLVAVQDS EADYVTTDTLSKVTEAAV- - 

hi 1 1 1 : : 1 * 1 • 1 ■ - * 1 ■ 1 ■ 1 


520 


Db 


1399 


1*1 1 1 1 1 * 1 • | ■■'1*1-1 
PTQVDVSQFGSFKEDTKMS I SEGTVSDKSATPVDEGAEDT YSHMEGVASVSTASVAT 


1455 


Qy 


521 


SNMPEGLTPDLVQEACESELNEATGTKI AYETKVDLVQTSEAI QES - LYPTAQLCP - - -S 
I : II III : : | : 1 : - | - 1 | | It. . -1 . . II 1 


576 


Db 


1456 


I II III i | * • | ■ | || | | - . | . - | | | 
SSFPEPTTDD-VSPSLHAEVGSPHSTEVDDSLSVSWQTPTTFQETEMSPSKEECPRPMS 


1514 


Qy 


577 


FEEAEATP -SPVLPDI VMEAPLNSLL - PSAGASWQ 

- : 1 1 ■ 1*1-11 1 * 1 1 • • 


610 


Db 


1515 


1 1 * 1 * 1 ■ ft 1 * 1 1 * • 
ISPPDFSPKTAKSRTPVQDHRSEQSSMSIEFGQESPEHSLAMDFSRQSPDHPTVGAGMLH 


1574 


Qy 


611 


PSVSPLEAPPPVSYD SIKLEPENPPPYEEAMNVALKALGTKEGIKEPESFN 

: : III | 1 : 1 II | . . 


661 


Db 


1575 


Ill 11*1 II * | 

I TEN - - -GPTEVDYS PSD I QDSSLSHKI PPTEEPS YTQDNDLS - ELI SVSQVEAS PSTSS 


1630 


Qy 


662 


AAVQETEAPYISIACDLIKETKLSTEPSPDFSNYSEIAKFEKSVPEHAELVEDSSPESEP 


721 


Db 


1631 


| | | | : : | | | | | : : | : | : | | | : | : 
AHTPS QIASPLQEDTLSDWPPRDMSLYASLASEKVQSLEGEKL SPKSDI 


1680 


Qy 


722 


VDLFSDDSI PEVPQTQEEAVMLMKESLTEVSETVAQHKEERLSASPQELGKPYL-ESFQP 


780 


Db 


1681 


| : | | : : | | | | : | : . . | | | . . | 
SPLTPRESSPTYSPGFSDSTSGAKES-TAAYQTSSSPPIDAAAAEPYGFRSSMLFDTMQH 


1739 


Qy 


781 


NLHSTKDAASNDIPTLTKKEKISLQMEEFNTAIYSNDDLLSSKEDKIKESETFSDSSPIE 


840 


Db 


1740 


HLALSRDLTTSS V EKDNGGKTPGDFNYAYQKPESTTES PDEEDYDYE - - SHEKTIQ 


1793 


Qy 


841 


I IDEFPTFVSAKDDSPK LAKEYTDLEVSDKSE I ANI QSGADSLPCLELPCDLSFKN 

1 • * 1 1 - 1 - 1 ■ -II 1 . . . 1 


896 


Db 


1794 


1 • • 1 1 • 1 • 1 ■ * 1 1 1 : : : 1 
AHDVGGYYYEKTERTIKSPCDSGYSYETIEKTTKTP EDGGYS- - CEITEKT 


1842 


Qy 


897 


I YPKDEVHVSDEFSENRSSVSKAS - - ISPSNVSALEPQTEMGSI VKSKSL 


944 


Db 


1843 


TRTPEEGGYSYEISEKTTRTPEVSGYTYEKTERSRRLLDDISNGYDDTEDGGHTLGDCSY 


1902 


Qy 


945 


TKEAEKKLPSDTEKEDRS 962 




Db 


1903 


= 1 :h 1 1 1 1 

SYETTEKITSFPESESYS 192 0 





RESULT 14 
T19431 

hypothetical protein C25A1.10 - Caenorhabdit is elegans 
C;Species: Caenorhabdit is elegans 

C;Date: 15-Oct-1999 #sequence_revision 15-Oct-1999 #text_change 21-Jan-2000 
C /Access ion : T19431 
R;Mortimore, B. 

submitted to the EMBL Data Library, October 1996 
A; Reference number: Z19124 
A; Accession : T19431 

A/Status: preliminary; translated from GB/EMBL/DDBJ 
A; Molecule type: DNA 
A;Residues: 1-971 <WIL> 



A; Cross-references : EMBL : Z8103 8 ; PIDN : CAB02755 . 1 ; GSPDB : GN00 019 ; CESP : C25A1 . 10 

A; Experimental source: clone C25A1 

C; Genetics : 

A;Gene: CESP : C2 5A1 . 10 

A /Map position: 1 

A;Introns: 38/3; 92/3; 201/3; 919/3 

C;Superfamily: nucleolus-cytoplasm shuttle phosphoprotein 

Query Match 4.9%; Score 288.5; DB 2; Length 971; 

Best Local Similarity .20.6%; Pred. No. 0.00016; 

Matches 198; Conservative 127; Mismatches 391; Indels 247; Gaps 35; 

QY 7 SSLVSSSTDSPPRPPPAFKYQFVTEP EDEEDEEEEEDEEEDDEDLEELEVLERKPAA 63 

II I I I II: II : :| : : : : . : h || :: :: | | 
Db 14 7 SSSDSDSDDEPPKKAPAVTTKVAPKPMAKKQDTSDSDSDSEDSDDGKSKKANPVKVTPVA 2 06 

QY 64 GLSAAAVP PAAAAPLLDFSSDS VP PAPRGPLPAAP PAAP ERQPSWE 109 

I Ih III II : I II I I |::| : 

Db 207 NVLQKWAKKAASSSSDSSDDEKKPAAK-PTPAKPTPKPWKKAESSSDSSDDEKKPVAK 2 65 

QY HO RSPAAPAPSLPPAAAVXiPSKLPEDDEPPARPPPPPPAGASPLAEPAAPPSTPAAPKRRGS 169 

HI III I III Ih I I hh ||:::: 

Db 266 PAPAKATPK- PAAKKADSSSDSSDDEAPAK- KTPAKAAPKPVAKKAESSSDSSDDEKK- - 321 

Qy 170 GSVDETLFALPAASEPVIPSSAEKIMDLMEQPGNTVSSGQEDFPSVLLETAASLPSLSPL 229 

III 1= h : : : | :| : : | : : |: 

Db 322 PAAK PTPAKATPKPVAKKAESSSDSSDDEKKPVAKPAPAKATPKPV 367 

Qy 230 STVSFKEHGYLGNLSAVSSSEGTIEETLNEASKELPERATNPFVNRDLAEFSELEYSE-- 287 

I Mh : U hi | :|| | = h | 

Db 368 AK KAESSSDSSDDEK-KPAAKPTPAKATPKPVAKKAESSSDSSDDEKK 414 

Qy 288 MGSSFKGS PKGESA I LVENTKEE VI VRS KDKEDLVCSAALHS PQES P VGKEDR WS P 344 

I I ■ I I I'--- I hi I :| 

Db 415 PVAKPTSAKATPK PAAKKAD SSSDSSDDEAPAKKTPAKAAP 455 

Qy 345 E KTMDIFNEMQMSWAPVREEYADFKPFEQAWEVKDTYEGSRD VLAARANV 395. 

Db 4 56 KPASKKAESSSDSSDDEKPAAKSTPAKITPKPTAKKVASSSSDSSDDEKKPAAKPTPANA 515 

QY 3 96 ESKVDRKCLEDSL EQKSLGKDSEGRNEDASFPSTPEPVKDSSRAYITCASFTSAT 450 

I I I I hi : I : : :|h| 
Db 516 TPKPVAKKAESSSDSSDDEKKPVAKPTSAK AT P KPAAKKADLS S DFS DDEAPA 568 

Qy 451 ESTTANTFPLLEDHTSENKTDEKKI EERKAQI ITEKTSPK-TSNPFLVAVQDSEADYVTT 509 

: I I I : h : I h h Ihll h : II I 

Db 56 9 KKTPAKAAPKPASKKAESSSDSSDDEKPAAKSTPAKTTPKPTAKKAASSSSDSSDD- -EK 62 6 

Qy 510 DTLSKVTEAAVSNMPEGLTPDLVQEACESELN EATGTKIAYETKVDLVQTSEAI 563 

-III : I I : : : | " : | | | : | | | : | : : 

Db 627 KPVAKPTSAKATPKPAAKKADSSSDSSDDEAPAKKTPVKPTPVKIVAK-KVD--SSSDSS 683 

Qy 564 QESLYPTAQLCPSFEEAEATPSPVLPDIVMEAPLNSLLPSAGASWQPSVSPLEAPPPVS 623 

= M =111 M h I II I 
Db 684 DDEKKPT KATPVKVTP KSVTKKAAAS S 710 

Qy 624 YDSIKLEPENPPPYEEAMNVALKALGTKEGIKEPESFNAAVQETEAPYISIACDLIKETK 683 



I I 

EKK 748 



Db 


711 


SDS - - SDDEKKP WKQTPNW PKKEKAASSSDDSS - DDEKK 


748 


Qy 


684 


LSTEPSPDFSNYSEIAKFEKSVPEHAELVEDSSPESEPVDLFSDDSIPEVPQTQEEAVML 
: - 1 : 1 1 • 1 • ' Ml ' 1 1 1 1 1 


743 


Db 


749 


I I 1 ' 1 * I 1 1 ■ 1 1 II 1 

PTAKPTP KATPKQSAKKADSSDDS - - SDDEAPA 


779 


Qy 


744 


MKESLTEVSETVAQHKEERLSAS PQELGKPYLESFQPNLHSTKDAASNDI PTLTKKEKI S 

II 1 III 1 


803 


Db 


780 


II 1 III 1 


799 


Qy 


804 


LQMEEFNTAI YSNDDLLSSKEDKI KESETFSDSSPI EI I DEFPTFVSAKDDS - - PKLAKE 
hll .. I 1 . 1 . 1.1 - - - 1 . | 1 1 1 - 1 


861 


Db 


800 


1 ■ 1 1 " 1 1 • 1 ■ l-l * * * 1 - II 11*1 
SSDDSSDDEKTKKKSATTPAKSTPKTALKKAESSDSSDDDEDLPKPSKA 


848 


Qy 


862 


YTDL-EVSDKSEIANIQSGADSLPCLEL-PCDLSFKNI YPKDEVHVSDEFSENRSSVSKA 


919 


Db 


849 


1 -III:: 1 h I I : :: | | | ::| 

VTPRPQRADSEESAETEESSSRTPALKAKPLATSTEKAVYENRKRKSSPF- -RRVQMTKD 


906 


Qy 


920 


SIS 922 




Db 


907 


hi 

SVS 909 





RESULT 15 
QRMSP1 

microtubule-associated protein MAP1B - mouse 

N;Alternate names: microtubule-associated protein MAPI (X) ; microtubule- 
associated protein MAPI. 2; microtubule-associated protein MAPS 
C; Species: Mus musculus (house mouse) 

C;Date: 30-Sep-1991 #sequence_revision 30-Sep-1991 #text_change 01-Sep-2000 
C;Accession: S07549; S44387; A33645 
R;Noble, M. ; Lewis, S.A.; Cowan, N.J. 
J. Cell Biol. 109, 3367-3376, 1989 

A; Title: The microtubule binding domain of microtubule-associated protein MAP1B 
contains a repeated sequence motif unrelated to that of MAP2 and tau. 
A;Reference number: A33645; MUID: 90094539 ; PMID:2480963 
A;Accession: S07549 
A; Molecule type: mRNA 
A/Residues : 1-2464 <NOB> 

A; Cross-references: EMBL : X513 96 ; NID:g52999; PIDN : CAA35761 . 1 ; PID:g53000 
R;Sanchez, C. ; Padilla, R. ; Paciucci, R. ; Zabala, J.C.; Avila, J. 
Arch. Biochem. Biophys. 310, 428-432, 1994 

A;Title: Binding of heat-shock protein 70 (hsp'7 0) to tubulin. 

A;Reference number: S44387; MUID : 94234720 ; PMID:8179328 

A; Access ion : S44387 

A; Status : preliminary 

A; Molecule type: protein 

A;Residues: 653-663,'IC <SAN> 

C;Superfamily : microtubule-associated protein MAP1B 

C; Keywords: microtubule binding; phosphoprotein; tandem repeat 

F;589-786/Domain: microtubule binding #status experimental <MTB> 

F; 589-592, 639-642,649-652,655-658,660-663,668-671,674-677, 679-682,683-686, 687- 

690,691-694,695-698,699-702,708-711,712-715,716-719,720-723,727-73 0,758-761,764- 

767, 783-786/Region: 4-residue repeats (K/R-K-E/D-X) 

F; 1861 -2064/Region : 17-residue repeats 



F; 91, 116,3 51, 88 8, 1124, 1153, 1168, 12 08, 1662, 1877, 1918 ,2 003 ,2030 ,2054, 2083/Binding 
site: phosphate (Ser) (covalent) #status predicted 

F; 147, 969, 1336, 1562, 1563, 1702, 1708, 1990, 2057, 2063, 24 19/Binding site: phosphate 
(Thr) (covalent) #status predicted 

F; 1953/Binding site: phosphate (Tyr) (covalent) #status predicted 

Query Match 4.9%; Score 284; DB 1; Length 2464; 

Best Local Similarity 20.7%; Pred . No. 0.00092; 

Matches 233; Conservative 159; Mismatches 424; Indels 312; Gaps 51; 

Qy 31 EPEDEEDEEEEEDEEED DEDLEELE VLERKPAAG LSAAAVP 71 

I I hi I llh II : I I I \ ■■■■ II I : I 

Db 1009 EAEQSEEEGEEEDKAEDAREEGYEPDKTEAEDYVMAVADKAAEAGVTEEQYGYLGTSAKQ 1068 

Qy 72 PAAAAPLLDFSS DSVPPAPRGPLPAAPPAAPERQP SWERSPAAPA 116 

I =1 : H = = = 1 hill I : I 

Db 1069 PGIQSPSREPASSIHDETLPGGSESEATASDEENREDQPEEFTATSGYTQSTIEISSEPT 1128 

Qy 117 P SLPPAAAVLPSKLPEDDEPPARP PPPPPAGASPLAEPA- - 155 

I I : h I h: I I : hi : 

Db 112 9 PMDEMSTPRDVMSDETNNEETESPSQEFVNITKYESSLYSQEYSKPAVASFNGLSEGSKT 118 8 

Qy 156 APPSTPAAPKRRGSGSVDETLFALPA ASEPVIPSSAEKIMDLMEQ 200 

I II ■■ I h = l hi : I = =111 =h = 

Db 1189 DATDGKDYNASASTI SPP SSMEEDKFSKSALRDAYCSEEKELKASAE- -LDI KDV 1241 

QY 2 01 PGNTVSSGQEDFPSVLLETAASLPSLSPLSTVSFKEHGYLGNLSAVSSSEGTIEETLNEA 260 

: I I Mill I Ihl 
Db 1242 SDERLS PAKSPSLSP SPPSPIEKT 1265 

Qy 261 SKELPERATNPFVNRDLAEFS ELEYSEMGSSFKGSPKGESAILVEN- -TKEEVI VR 314 

I Ih I II h = I I = II |:: | = =11 

Db 1266 - -PLGERSVN FSLTPNEIKVSAEGEARSVSPGVTQAWEEHCASPEEKTLE 1314 

Qy 315 S KDKEDL VCSAALHS P - QES PVGKEDRWS PEKTMDI FNEMQMS WA- P VREE YADFK- - 370 

I : I hi : I I : ||: : |: : | || |::: | 

Db 1315 WSPSQSVTGSAGHTPYYQSP TDE KS SHLPTE VS ENAQA VP VS F E F S EAKDE 1366 

Qy 3 71 PFEQAWEVKDTYEGSRDVLAARAtsTVESKVDRKCLED- - SLEQKSLGKDS 417 

I - I h Ih : If I I Ih I 

Db 1367 NERASLSPMDE- -PVPDSESPVEKVLSPLRSPPLLGSESPYEDFLSADSKVLGRRSESPF 1424 

Qy 418 EGRNEDASFPSTPEPVKDSSRAYITCASFTSATESTTANTFPLLEDHTSENKT 470 

Ihl II II I = I : h II I II 

Db 1425 EGKNGKQGFPDRESPVSDLT STGLYQDKQEEKSTGFI PI KEDFGPEKKTSDVETMS 1480 

Qy 471 DEKKI EERKAQI ITEKTSPKTSNPFLVAVQDSEAD 505 

Ihh h | | I I : : I I : I 

Db 1481 SQSALALDERKLGGDVSPTQIDVSQFGSFKEDTKMSISEGTVSDKSATPVDEGV--AEDT 1538 

Qy 506 YVTTDTLSKVTEAAV--SNMPEGLTPDLVQEACESELNEATGTKIAYETKVDLVQTSEAI 563 

I ■ ■■■ \- hi h II III : :h | = = I UN 

Db 153 9 YSHMEGVASVSTASVATSSFPEPTTDD-VSPSLHAEVGSPHSTEVDDSLSVSWQTPTTF 1597 

Qy 564 QES -LYPTAQLCP SFEEAEATP SPVLPDI VMEAPLNSLL- - 601 

Ih : h : II I : :| | : : |:| :| 

Db 1598 QETEMSPSKEECPRPMSISPPDFSPKTAKSRTPVQDHRSEQSSMSIEFGQESPEHSFAMD 1657 



QY 602 PSAGASWQPSVSPLEAPPPVSYD SI KLEPENPPPYEEAMNVA 644 

h Mlh : : III | |: | | | : ::: 

Db 1658 FSRQSPDHPTL.GASVLHITEN GPTEVDYSPCDIQDSSLSHKIPPTEEPSYTQDNDLS 1714 

QY 645 LKALGTKEGI KEPESFNAAVQETEAPYI S IACDLI KETKLSTEPSPDFSNYSEIAKFEKS 704 

' ■ : I : : I II I : = | I = | h =| 
Db 1715 -ELISVSQVEASPSTSSAHTPS QIASPLQEDTLSDWPPREMSLYASLA 1762 

Qy . 705 VPEHAELVEDSS PESEPVDLFSDDS I PEVPQTQEEAVML MKESLTEVSETVAQHKE 760 

= I h I I I = || | I I |: | :| : || | | 

Db 1763 SEKVQ- -SLEGEKLSPKSDI S PLTPRESSPLYSPGFSDSTSAAKETAAAH- - 1810 

QY 761 ERLSAS P QELGKP Y LESFQPNLHSTKDAASNDI PTLTKKEKI SLQMEEFN 810 

: III HI - I :| :| :: : =|: :|| 

Db 1811 - QAS S S P P I DAATAE P YGFRS SMLFDTMQHHLALNRDLTTS S V EKDSGGKTPGDFN 1865 

QY 811 TAI YSNDDLLSSKEDKIKESETFSDSSPIEI IDEFPTFVSAKDDSPK LAKEYTDLE 866 

I - | ::: : | | | | : : : | | :| 

Db 1866 YAYQKPENAAGSPDEEDYDYE - - SQEKTI RTHDWRYYYEKTERTI KS PCDSGYS YETI E 1923 

QY 867 VSDKSEIANI QSGADSLPCLELPCDLSFKNI YPKDEVHVSDEFSENRSSVSKAS 920 

* 1 = 1 I I : : : I 1 I MM: : I 

Db 1924 KTTKTP EDGG YTCEITEKTTRTPEEGGYSYEISEKTTRTPEVSGYTYEK 1972 

Qy 921 I SPSNVSALEPQTEMGS I VKSKSLTKEAEKKLPSDTEKEDRS 962 

: = = I = I = I =1= I I I I 
Db 1973 TERSRRLLDDISNGYDDTEDGGHTLGDCSYSYETTEKITSFPESESYS 2020 



Search completed: December 19, 2003, 15:37:23 
Job time : 34 sees 

GenCore version 5.1.6 
Copyright (c) 1993 - 2003 Compugen Ltd. 



OM protein - protein search, using sw model 



Run on: 



Title: 

Perfect score: 
Sequence : 

Scoring table: 



December 19, 2003, 15:29:47 ; Search time 18 Seconds 

(without alignments) 
3038.448 Million cell updates/sec 

US- 09 -83 0-972 -2 
5848 

1 MEDIDQSSLVSSSTDSPPRP VKDAMAKI QAKI PGLKRKAD 1163 

BLOSUM62 

Gapop 10.0 , Gapext 0.5 



Searched : 



127863 seqs, 47026705 residues 



Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 



127863 



Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 

Database : SwissProt_4 1 : * 

Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 

SUMMARIES 

Result Query 



No . 


Score 


Match 


Length 


DB 


ID 


Description 


1 


5848 


100 


. 0 


1163 


1 


RTN4_RAT 


(jyjKii 


rattus norv 


2 


4403 . 5 


75 


. 3 


1192 


1 


RTN4_HUMAN 


yynqcj 


homo sapien 


3 


917 


15 


. 7 


199 


1 


RTN4_MOUSE 


^ y yp 1 A 


mus musculu 


4 


801 


13 


. 7 


777 


1 


RTN1_RAT 


l*Jo^± 3 <± O 


rattus norv 


5 


789 . 5 


13 


. 5 


776 


1 


RTN1_HUMAN 


f~\1 £Z 1 Q Q 

\j± b / y y 


homo sapien 


6 


625 . 5 


10 


. 7 


236 


1 


RTN3_HUMAN 


w ~j d x y / 


homo sapien 


7 


625 . 5 


10 


. 7 


237 


1 


RTN3_M0USE 


^ _7trb y / 


mus musculu 


8 


509 


8 


.7 


471 


1 


RTN2_MOUSE 


U/UDZZ 


nius museum 


9 


490 


8 


. 4 


545 


1 


RTN2_HUMAN 


u / 3z y CS 


homo sapien 


10 


308.5 


5 


. 3 


5147 


1 


PCLO_HUMAN 


yyybvu 


homo sapien 


11 


300 


5 


. 1 


5120 


1 


PCLO_CHICK 


yypuj b 


gallus gall 


12 


295 . 5 


5 


. 1 


865 


1 


CPN_DR0ME 


\l vz y iu 


drosophila 


13 


291.5 


5 


. 0 


2459 


1 


MAPB RAT 


Dl C O A C 


rattus norv 


14 


288 


4 


. 9 


2468 


1 


MAPB HUMAN 


d>i con 
F4 b 0 z 1 


homo sapien 


15 


284 


4 


. 9 


2464 


1 


MAPB_MOUSE 


Jr±4 o / o 


mus musculu 


16 


282.5 


4 


. 8 


2805 


1 


MA PA_HUMAN 


p*7 Q c c; q 


homo sapien 


17 


279 


4 


. 8 


5038 


1 


PCL0_M0USE 


txjyqyx / 


mus musculu 


18 


275.5 


4 


. 7 


3924 


1 


ANK2_HUMAN 


-Lt: 0<± 


homo sapien 


19 


275 


4 


. 7 


1972 


- 1 


P531_HUMAN 




homo sapien 


20 


272 


4 


. 7 


5085 


1 


PCL0__RAT 


^yj KSb 


rattus norv 


21 


267 


4 


. 6 


1781 


1 


AK12_HUMAN 


yuz y jz 


homo sapien 


22 


265 . 5 


4 . 


. 5 


1828 


1 


MAP2_MOUSE 


P20357 


mus musculu 


23 


261.5 


4. 


.5 


1087 


1 


NFH MOUSE 


P19246 


mus musculu 


24 


259.5 


4 . 


.4 


1020 


1 


NFH_HUMAN 


P12036 


homo sapien 


25 


259.5 


4 . 


.4 


2404 


1 


SONJVIOUSE 


Q9qx47 


mus musculu 


26 


258 . 5 


4 . 


.4 


1861 


1 


MAP2__RAT 


P15146 


rattus norv 


27 


257.5 


4 . 


.4 


2715 


1 


TRX2_HUMAN 


Q9umn6 


homo sapien 


28 


256 


4 . 


4 


1723 


1 


AIM1_HUMAN 


Q9y4kl 


homo sapien 


2 9 


255.5 


4 . 


4 


1140 


1 


YM96_YEAST 


Q04893 


saccharomyc 


30 


254 .5 


4. 


4 


6632 


1 


UN8 9_CAEEL 


001761 


caenorhabdi 


31 


254 


4 . 


3 


1616 


1 


P2 0 0_MYCGE 


Q49429 


mycoplasma 


32 


254 


4 . 


3 


3381 


1 


PGCV_BOVIN 


P81282 


bos taurus 


33 


253 


4 . 


3 


3421 


1 


TEGU_HSVEB 


P28955 


equine herp 


34 


252 . 5 


4 . 


3 


1411 


1 


TCOF HUMAN 


Q13428 


homo sapien 


35 


252 


4 . 


3 


2738 


1 


PGCV_RAT 


Q9erb4 


rattus norv 


36 


250.5 


4 . 


3 


4377 


1 


AN K3 _HUMAN 


Q12955 


homo sapien 


37 


248.5 


4 . 


2 


1189 


1 


YJH6_YEAST 


P47035 


saccharomyc 


38 


248 


4 . 


2 


1827 


1 


MAP2_HUMAN 


P11137 


homo sapien 


39 


245.5 


4 . 


2 


1001 


1 


IF2_SYNY3 


P72689 


synechocyst 


40 


244 


4 . 


2 


1338 


1 


ACIN_MOUSE 


Q9j ix8 


mus musculu 


41 


241.5 


4 . 


1 


8545 


1 


ANC1_CAEEL 


Q9n4m4 


caenorhabdi 


42 


238 . 5 


4 . 


1 


831 


1 


NFH_RAT 


P16884 


rattus norv 



43 


238 .5 


4 . 


.1 


1341 


1 


ACIN_ 


_HUMAN 


Q9ukv3 


homo sapien 


44 


238 


4 . 


. 1 


1306 


1 


MSB2_ 


"yeast 


P32334 


saccharomyc 


45 


236.5 


4 . 


. 0 


2774 


1 


MAPA~ 


RAT 


P34926 


rattus norv 



ALIGNMENTS 



RESULT 1 
RTN4_RAT 

ID RTN4_RAT STANDARD; PRT; 1163 AA. 

AC Q9JK11; Q9JK10; Q9R0D9; Q9WUE9 ; Q9WUF0 ; 

DT 28-FEB-2003 (Rel . 41, Created) 

DT 28-FEB-2003 (Rel. 41, Last sequence update) 

DT 15-SEP-2003 (Rel. 42, Last annotation update) 

DE Reticulon 4 (Neurite outgrowth inhibitor) (Nogo protein) (Foocen) 

DE (Glut4 vesicle 20 kDa protein) . 

GN RTN4 OR NOGO. 

OS Rattus norvegicus (Rat) . 

OC Eukaryota; Metazoa; Chordata; Crania ta ; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Rattus. 

OX NCBI_TaxID=10116; 
RN [1] 

RP SEQUENCE FROM N.A. (ISOFORM 3), AND PARTIAL SEQUENCE . 

RC STRAIN-Sprague-Dawley; TISSUE=Adipocyte ; 

RX MEDLINE=99249816; PubMed=l 023 1557 ; 

RA Morris N.J., Ross S.A., Neveu J.M., Lane W.S., Lienhard G.E.; 

RT "Cloning and characterization of a 22 kDa protein from rat adipocytes: 

RT a new member of the reticulon family."; 

RL_ Biochim. Biophys . Acta 1450:68-76(1999). 

RN [2] 

RP SEQUENCE FROM N.A. (ISOFORMS 1; 2 AND 3) . 

RX MEDLINE=20129258; PubMed-1 0667796 ; 

RA Chen M.S., Huber A.B., Van der Haar M.E., Frank M. , Schnell L. , 

RA Spillmann A. A. , Christ F . , Schwab M.E.; 

RT 11 Nogo -A is a myel in-associated neurite outgrowth inhibitor and an 

RT antigen for monoclonal antibody IN-l."; 

RL Nature 403 :434-439 (2000) . 

RN [3] 

RP SEQUENCE FROM N.A. (ISOFORMS 2 AND 4) . 

RC STRAIN=Wistar Kyoto; TISSUE=Vascular smooth muscle; 

RA Ito T., Schwartz S.M.; 

RT "Cloning of a member of the reticulon gene family in rat: one of two 

RT minor splice variants."; 

RL Submitted (FEB-1999) to the EMBL/ GenBank / DDB J databases. 

RN [4] 

RP FUNCTION . 

RX MEDLINE=22033691; PubMed=12 03 7567 ; 

RA GrandPre T., Li S., Strittmatter S.M.; 

RT "Nogo- 66 receptor antagonist peptide promotes axonal regeneration. " ; 

RL Nature 417:547-551(2002). 

CC -!- FUNCTION: Potent neurite outgrowth inhibitor which may also help 
CC block the regeneration of the nervous central system in adults (By 

CC similarity) . 

CC -!- SUBUNIT: Binds to RTN4R. Interacts with Bcl-xl and Bel -2 (By 
CC similarity) . 

CC -!- SUBCELLULAR LOCATION: Integral membrane protein. Anchored to the 



cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 

DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 



membrane of the endoplasmic reticulum through 2 putative 
transmembrane domains (By similarity) . 
-!- ALTERNATIVE PRODUCTS:. 

Event ^Alternative splicing; Named isoforms=4; 
Name- 1 ; Synonyms =Nogo -A, NI-220-250; 

IsoId=Q9JKll-l; Sequence=Displayed; 
Name=2; Synonyms =Nogo-B, Foocen-Ml; 

IsoId=Q9JKll-2 ; Sequence=VSP_005658 ; 
Name=3; Synonyms =Nogo - C , VP20; 

IsoId=Q9JKll-3; Sequence=VSP_005656, VSP_005657; 
Name=4; Synonyms =Fooc en - M2 ; 

IsoId=Q9JKll-4; Sequence=VSP_00565 9; 
-!- TISSUE SPECIFICITY; Isoforms 1, 2 and 3 are present in optic 
nerve, spinal cord and cerebral cortex. Isoforms 1 and 2 are 
present in dorsal root ganglion, sciatic nerve and PC12 cells 
after longer exposure. Isoforms 2 and 3 are detected in kidney, 
cartilage, skin, lung and spleen. Isoform 3 is expressed at high 
level in skeletal muscle. In adult animals isoform 1 is expressed 
mainly in the nervous system. 
-!- SIMILARITY: Contains 1 reticulon domain. 

This SWISS-PROT entry is copyright. It is produced through a collaboration 
between the Swiss Institute of Bioinf ormat ics and the EMBL outstation - 
the European Bioinf ormatics Institute. There are no restrictions on its 
use by non-profit institutions as long as its content is in no way 
modified and this statement is not removed. Usage by and for commercial 
entities requires a license agreement (See http://www.isb-sib.ch/announce/ 
or send an email to license@isb-sib . ch) . 



EMBL; AF051335 
EMBL; AJ242961 
EMBL; AJ242962 
EMBL ; AJ242963 
EMBL; AF132045 
EMBL; AF132046 
GO; GO: 0030176 
GO; GO: 0005635 
GO; GO: 0005515 
GO; GO: 0019987 
GO; GO:0030517 



InterPro; IPR003388; Reticulon. 
Pfam; PF02453; Reticulon; l. 



AAF01564. 
CAB71027 . 
CAB7102 8 . 
CAB71029 . 
AAD31019. 
AAD3102 0. 

C: endoplasmic reticulum membrane, intrinsic p. 
C: nuclear membrane; ISS. 
F:protein binding activity; ISS. 
P: negative regulation of anti-apoptosis ; 
P: negative regulation of axon extension; 



IDA. 



ISS. 
ISS. 



DR 


PROSITE; 


PS50845; 


RETICULON; 


1. 


KW 


Endoplasmic reticulum; Alternative splicing; Transmembrane. 


FT 


DOMAIN 


1 


989 


CYTOPLASMIC (Potential) . 


FT 


TRANSMEM 


990 


1010 


POTENTIAL. 


FT 


DOMAIN 


1011 


1104 


LUMENAL (Potential) . 


FT 


TRANSMEM 


1105 


1125 


POTENTIAL . 


FT 


DOMAIN 


1126 


1163 


CYTOPLASMIC (Potential) . 


FT 


DOMAIN 


976 


1163 


RETICULON. 


FT 


DOMAIN 


33 


46 


POLY-GLU. 


FT 


DOMAIN 


73 


76 


POLY -ALA. 


FT 


DOMAIN 


140 


145 


POLY -PRO. 


FT 


VARSPLIC 


1 


964 


Missing (in isoform 3) . 


FT 








/FTId=VSP_005656. 


FT 


VARSPLIC 


965 


975 


AVLSAELSKTS -> MDGQKKHWKDK (in isoform 



FT 



FT /FTId-VSP_005657 . 

FT VARSPLIC 173 975 Missing (in isoform 2). 

FT /FTId=VSP_005658 . 

FT VARSPLIC 192 975 Missing (in isoform 4) . 

FT /FTId-VSP_005659 . 

FT CONFLICT 1130 1131 MISSING (INREF. 3; AAD31020). 

SQ SEQUENCE 1163 AA; 126386 MW; 8CB894B09E94F0B6 CRC64 ; 

Query Match 100.0%; Score 5848; DB 1; Length 1163; 

Best Local Similarity 100.0%; Pred. No. 9.1e-214; 

Matches 1163; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

QY 1 MEDIDQSSLVSSSTDSPPRPPPAFKYQFVTEPEDEEDEEEEEDEEEDDEDLEELEVLERK 60 

I I I I I I M I I I I I I I I I I I I I I I | | M I I I I I I I I I I I I M I I I II I I I I I I I I I I I | | | 
Db 1 MEDIDQSSLVSSSTDSPPRPPPAFKYQFVTEPEDEEDEEEEEDEEEDDEDLEELEVTjERK 60 

Qy 61 PAAGLSAAAVPPAAAAPLLDFSSDSVPPAPRGPLPAAPPAAPERQPSWERSPAAPAPSLP 120 

Db 61 PAAGLSAAAVPPAAAAPLLDFSSDSVPPAPRGPLPAAPPAAPERQPSWERSPAAPAPSLP 120 

Qy 121 PAAAVLPSKLPEDDEPPARPPPPPPAGASPLAEPAAPPSTPAAPKRRGSGSVDETLFALP 18 0 

loi I H', i i i i ; , i i i i i i i ; i i i i r i : 

Db 121 PAAAVLPSKLPEDDEPPARPPPPPPAGASPLAEPAAPPSTPAAPKRRGSGSVDETLFALP 180 

Qy 181 AASEPVI PSSAEKIMDLMEQPGNTVSSGQEDFPSVLLETAASLPSLSPLSTVSFKEHGYL 24 0 

IIIIIIIIIIMIMIIIIIIMIIIIMIIMIIIIIIIIIIMIIIIIIMIIIIMI 

Db 181 AASEPVI PSSAEKIMDLMEQPGNTVSSGQEDFPSVLLETAASLPSLSPLSTVSFKEHGYL 24 0 

QY 241 GNLSAVSSSEGTIEETLNEASKELPERATNPFVNRDLAEFSELEYSEMGSSFKGSPKGES 300 

iiiiiiiiiiiiiiiii : i i i ii ii ii i ii i ii in 1 1 

Db 241 GNLSAVSSSEGTIEETLNEASKELPERATNPFVNRDLAEFSELEYSEMGSSFKGSPKGES 300 

QY 301 AILVENTKEEVIVRSKDKEDLVCSAALHSPQESPVGKEDRWSPEKTMDIFNEMQMSWA 360 

_ IIIIIIIIIIIIIIIII I I I I I II I I I I | || | | | | | || | | | | 

Db 301 A I LVENTKEEVI VRS KDKEDLVCSAALHS PQES PVGKEDRWS PEKTMDI FNEMQMS WA 360 

Qy 361 PVREEYADFKPFEQAWEVKDTYEGSRDVLAARANVESKVDRKCLEDSLEQKSLGKDSEGR 42 0 

M I M 1 1 II I I I M I I I II I II I I I I I I || I I I || | | | | | M I II I I I I I I II I I I I I M 
Db 361 PVREEYADFKPFEQAWEVKDTYEGSRDVLAARANVESKVDRKCLEDSLEQKSLGKDSEGR 42 0 

Qy 421 NEDASFPSTPEPVKDSSRAYITCASFTSATESTTANTFPLLEDHTSENKTDEKKIEERKA 480 

1 1 1 1 1 1 M I M I M II I I I I I I I I M I I | | | | | || | M | M I I M I I I I I I II I I I I I || 
Db 421 NEDASFPSTPEPVKDSSRAYITCASFTSATESTTANTFPLLEDHTSENKTDEKKIEERKA 48 0 

Qy 4 81 QI 1TEKTSPKTSNPFLVAVQDSEADYVTTDTLSKVTEAAVSNMPEGLTPDLVQEACESEL 540 

AQ IIIIMMIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIlMllliiMIIIIIIIII 

Db 481 QI ITEKTSPKTSNPFLVAVQDSEADYVTTDTLSKVTEAAVSNMPEGLTPDLVQEACESEL 540 

Qy 541 NEATGTKIAYETKVDLVQTSEAIQESLYPTAQLCPSFEEAEATPSPVLPDIVMEAPLNSL 600 

MIIIIIIIIIMIIIIMIIIIIIMIIIIIIIIIIIIIIIMIIIMIIIIIIIIIII 

Db 541 NEATGTKIAYETKVDLVQTSEAIQESLYPTAQLCPSFEEAEATPSPVLPDIVMEAPLNSL 600 

QY 601 LPSAGASWQPSVSPLEAPPPVSYDSIKLEPENPPPYEEAMNVALKALGTKEGIKEPESF 660 

a N 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

Db 601 LPSAGASWQPSVSPLEAPPPVSYDSIKLEPENPPPYEEAMNVALKALGTKEGIKEPESF 660 

Qy 661 NAAVQETEAPYISIACDLIKETKLSTEPSPDFSNYSEIAKFEKSVPEHAELVEDSSPESE 720 



I I I I I III I: MM 

FSNYSEIAKFEKSVPEHAELVEDSSPESE 7 



Db 


661 


NAAVQETEAPYI SIACDLI KETKLSTEPSPDFSNYSEIAKFEKSVPEHAELVEDSSPESE 


720 


Qy 


721 


PVDLFSDDSI PEVPQTQEEAVMLMKESLTEVSETVAQHKEERLSASPQELGKPYLESFQP 

Miiiiiiiiiiiiiiiiiiiiiiiiiimiiiiiiiiiiiiiiiiiiii iiiiiiii 


780 


Db 


721 


PVDLFSDDSI PEVPQTQEEAVMLMKESLTEVSETVAQHKEERLSASPQELGKPYLESFQP 


780 


Qy 


781 


NLHSTKDAASNDIPTLTKKEKISLQMEEFNTAIYSNDDLLSSKEDKIKESETFSDSSPIE 


840 


Db 


781 


MIMMII MMI MMIMMMMMMMMMIMMIMM MIIMI! 

NLHSTKDAASNDIPTLTKKEKISLQMEEFNTAIYSNDDLLSSKEDKIKESETFSDSSPIE 


840 


Qy 


841 


I IDEFPTFVSAKDDSPKLAKEYTDLEVSDKSEIANIQSGADSLPCLELPCDLSFKNI YPK 

MIMMII MMI IIIIIIMIIMIIMMMIIMIII Mill MIIMI 

IIDEFPTFVSAKDDSPKLAKEYTDLEVSDKSEIANIQSGADSLPCLELPCDLSFKNIYPK 


900 


Db 


841 


900 


Qy 


901 


DEVHVSDEFSENRSSVSKASISPSNVSALEPQTEMGSI VKSKSLTKEAEKKLPSDTEKED 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 II 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

DEVHVSDEFS ENRS S VS KAS I S PSNVSALE PQTEMGS I VKS KSLTKEAEKKLPSDTEKED 


960 


Db 


901 


960 


Qy 


961 


RSLSAVLSAELSKTSWDLLYWRDI KKTGWFGASLFLLLSLTVFS I VSVTAYI ALALLS 

MMIIMI Mill IMIIIMI: IMMIMMMIM! MM! MIIMI 

RSLSAVLSAELSKTSWDLLYWRDI KKTGWFGASLFLLLSLTVFS I VSVTAYI ALALLS 


1020 


Db 


961 


1020 


Qy 


1021 


VTISFRI YKGVIQAIQKSDEGHPFRAYLESEVAISEELVQKYSNSALGHVNSTIKELRRL 
1 1 1 1 1 1 1 1 1 1 1 1 ! II 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 


1080 


Db 


1021 


1 1 1 1 1 1 1 1 1 1 1 1 I M 1 1 1 1 If 1 1 1 M 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 11 M 1 1 1 1 1 1 
VTISFRI YKGVIQAIQKSDEGHPFRAYLESEVAISEELVQKYSNSALGHVNSTIKELRRL 


1080 


Qy 


1081 


FLVDDLVDSLKFAVLMWVFTYVGALFNGLTLLI LALI SLFS I PVI YERHQVQI DHYLGLA 

1 II Mi MINI Mill Mil II Mill IIIIIIII II III MM MIIMI Mill III 

F LVDDLVDS LKFA VLMW VFT YVGALFNGLTLL I LALI SLFS I PVI YERHQVQI DHYLGLA 


1140 


Db 


1081 


1140 


Qy 


1141 


NKSVKDAMAKI QAKI PGLKRKAD 1163 




Db 


1141 


lllMMM Mill MM 

NKSVKDAMAKI QAKI PGLKRKAD 1163 





RESULT 2 
RTN4_HUMAN 

ID RTN4_HUMAN STANDARD; PRT; 1192 AA . 

AC Q9NQC3; 094962; Q9BXG5 ; Q9H212; Q9H3I3; Q9UQ42; Q9Y293; Q9Y2Y7; 

AC Q9Y5U6; 

DT 28-FEB-2003 (Rel . 41, Created) 

DT 28-FEB-2003 (Rel. 41, Last sequence update) 

DT 15-SEP-2003 (Rel. 42, Last annotation update) 

DE Reticulon 4 (Neurite outgrowth inhibitor) (Nogo protein) (Foocen) 

DE (Neuroendocrine-specif ic protein) (NSP) (Neuroendocrine specific 

DE protein C homolog) (RTN-x) (Reticulon 5) (My043 protein) . 

GN RTN4 OR NOGO OR ASY OR KIAA0886. 

OS Homo sapiens (Human) . 

OC Eukaryota,* Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

OX NCBI_TaxID=9606 ; 

RN [1] 

RP SEQUENCE FROM N.A. (ISOFORMS 1; 2 AND 3) . 

RX MEDLINE-20129242; PubMed=l 066778 0 ; 

RA Prinjha R., Moore S.E., Vinson M., Blake S., Morrow R. , Christie G., 

RA Michalovich D. , Simmons D.L., Walsh F.S.; 



RT "Inhibitor of neurite outgrowth in humans."; 

RL Nature 403:383-384 (2000) . 

RN [2] 

RP SEQUENCE FROM N.A. (ISOFORMS 1 AND 2) . 

RC TISSUE=Brain; 

RX MEDLINE=21010696; PubMed-1112636 0 ; 

RA Tagami S., Eguchi Y. , Kinoshita M . , Takeda M. , Tsuj imoto Y. ; 

RT "A novel protein, RTN-XS, interacts with both Bcl-XL and Bcl-2 on 

RT endoplasmic reticulum and reduces their anti-apoptotic activity."; 

RL Oncogene 19:5736-5746(2000). 

RN [3] 

RP SEQUENCE FROM N.A. (ISOFORMS 1; 2 AND 3) . 

RX MEDLINE-20237542; PubMed=1077368 0 ; 

RA Yang J., YuL, Bi A.D., Zhao S.-Y.; 

RT "Assignment of the human reticulon 4 gene (RTN4) to chromosome 

RT 2pl4-->2pl3 by radiation hybrid mapping."; 

RL Cytogenet. Cell Genet. 88:101-102(2000). 

RN [4] 

RP SEQUENCE FROM N.A. (ISOFORM 4) . 

RA Jin W. -L. , Ju G. ; 

RT "Developmentally-regulated alternative splicing in a novel Nogo-A." ; 

RL Submitted (NOV-2000) to the EMBL/ GenBank/DDBJ databases. 

RN [5] 

RP SEQUENCE FROM N.A. (ISOFORMS 2 AND 3) . 

RC TISSUE-Placenta, and Skeletal muscle; 

RA Ito T., Schwartz S.M.; 

RT "Cloning of a member of the reticulon gene family in human."; 

RL Submitted (FEB-1999) to the EMBL/ GenBank/DDBJ databases 

RN [6] 

RP SEQUENCE FROM N.A. (ISOFORM 2) . 

RC TISSUE=Fibroblast ; 

RA. Yutsudo M. ; 

RT "Isolation of a cell death- inducing gene."; 

RL Submitted (JUN-1998) to the EMBL/GenBank/DDBJ databases. 

RN [7] 

RP SEQUENCE FROM N.A. (ISOFORM 3). 

RC TISSUE=Pituitary; 

RA Song H., Peng Y. , Zhou J., Huang Q. , Dai M., Mao Y.M. , Yu Y. , Xu X. , 

RA Luo B . , Hu R . , Chen J . • 

RT "Human neuroendocrine-specif ic protein C (NSP) homolog gene." ; 

RL Submitted (JUL-1998) to the EMBL/GenBank/DDBJ databases 

RN [8] 

RP SEQUENCE FROM N.A. (ISOFORM 3). 

RA Gu J.R-. , Wan D.F.-, Zhao X!"T. , Zhou X.M., Jiang H.Q., Zhang P.P., 

RA Qin W.X., Huang Y. , Qiu X.K., Qian L.F., He L.P., Li H.N., Yu Y 

RA Yu J., Han L . H . ; 

RT "Novel human cDNA clone with function of inhibiting cancer cell 

RT growth . " ; 

RL Submitted (AUG-1999) to the EMBL/GenBank/DDBJ databases 

RN [9] 

RP SEQUENCE FROM N.A. (ISOFORM 1) . 

RC TISSUE=Brain ; 

RX MEDLINE=99156230; PubMed=10048485 ; 

RA Nagase T. , Ishikawa K.-I., Suyama M., Kikuno R. , Hirosawa M. , 

RA Miyajima N., Tanaka A., Kotani H., Nomura N. , Ohara 0. ; 

RT "Prediction of the coding sequences of unidentified human genes. XII. 

RT The complete sequences of 100 new cDNA clones from brain which code 



RT for large proteins in vitro."; 

RL DNA Res. 5:355-364(1998). 

RN [10] 

RP SEQUENCE FROM N.A. (ISOFORMS 2 AND 3) . 

RC TISSUE=Brain, Pancreas, Placenta, and Skeletal muscle; 

RX MEDLINE=22388257; PubMed=12477932 ; 

RA Strausberg R.L., Feingold E.A. , Grouse L.H., Derge J.G. , 

RA Klausner R.D., Collins F.S., Wagner L. , Shenmen CM. , Schuler G.D. , 

RA Altschul S.F., Zeeberg B. , Buetow K.H., Schaefer C.F., Bhat N.K., 

RA Hopkins R.F., Jordan H. , Moore T., Max S.I., Wang J. , Hsieh F., 

RA Diatchenko L. , Marusina K. , Farmer A. A. , Rubin G.M., Hong L. , 

RA Stapleton M. , Soares M.B., Bonaldo M.F., Casavant T.L., Scheetz T.E., 

RA Brownstein M.J., Usdin T.B., Toshiyuki S., Carninci P., Prange C. , 

RA Raha S.S., Loquellano N.A. , Peters G.J., Abramson R.D., Mullahy S.J., 

RA Bosak S.A., McEwan P.J., McKernan K.J., Malek J. A. , Gunaratne P.H., 

RA Richards S., Worley K.C., Hale S., Garcia A.M., Gay L.J., Hulyk S.W., 

RA Villalon D.K. # Muzny D.M., Sodergren E.J., Lu X., Gibbs R.A. , 

RA Fahey J., Helton E . , Ketteman M. , Madan A., Rodrigues S., Sanchez A., 

RA Whiting M. , Madan A., Young A.C., Shevchenko Y. , Bouffard G.G., 

RA Blakesley R.W., Touchman J.W. , Green E.D., Dickson M.C., 

RA Rodriguez A.C., Grimwood J., Schmutz J., Myers R.M., 

RA Butterfield Y.S.N. , Krzywinski M.I., Skalska U. , Smailus D.E., 

RA Schnerch A., Schein J.E., Jones S.J.M., Marra M.A.; 

RT "Generation and initial analysis of more than 15,000 full-length 

RT human and mouse cDNA sequences . " ; 

RL Proc. Natl. Acad. Sci . U.S.A. 99:16899-16903(2002). 

RN [11] 

RP SEQUENCE FROM N.A. (ISOFORM 3) . 

RX MEDLINE=20499367; PubMed=l 1 042 152 ; 

RA Zhang Q.-H., Ye M. , Wu X.-Y., Ren S.-X., Zhao M . , Zhao C.-J., Fu G. , 

RA Shen Y. , Fan H.-Y., Lu G., Zhong M . , Xu X.-R., Han Z.-G., Zhang J.-W., 

RA Tao J., Huang Q.-H., Zhou J., Hu G.-X., Gu J., Chen S.-J., Chen Z . ,- 

RT "Cloning and functional analysis of cDNAs with open reading frames for 

RT 300 previously undefined genes expressed in CD34 + hematopoietic 

RT stem/progenitor cells . " ; 

RL Genome Res. 10:1546-1560(2 000). 

RN [12] 

RP .SEQUENCE OF 482-1192 FROM N.A. ( ISOFORM 1/4 ) . 

RC TISSUE-Brain; 

RA Mao Y.M., Xie Y., Zheng Z.H.; 

RL Submitted (MAY-1998) to the EMBL/GenBank/DDBJ databases. 

RN [13] 

RP SEQUENCE OF 186-1192 FROM N.A. (ISOFORM 1). 

RC TISSUE=Testis; 

RA Sha J.H., Zhou Z.M., Li J.M.; 

RL Submitted (JAN-2001) to the EMBL/GenBank/DDBJ databases. 

RN [14] 

RP TOPOLOGY . 

RC TISSUE=Brain; 

RX MEDLINE=20129259; PubMed=10667797 ; 

RA GrandPre T. , Nakamura F., Vartanian T. , Strittmatter S . M . ; 

RT "Identification of the Nogo inhibitor of axon regeneration as a 

RT Reticulon protein."; 

RL Nature 403:439-444(2000). 

RN [15] 

RP FUNCTION. 

RC TISSUE=Brain; 



RX MEDLINE=21069055; PubMed=l 12 01742 ; 

RA Fournier A.E., Grandpre T. , Strittmatter S.M.; 

RT "Identification of a receptor mediating Nogo-66 inhibition of axonal 

RT regenerat ion . " ; 

RL Nature 409:341-346(2001). 

RN [16] 

RP REVIEW . 

RX MEDLINE=21888956; PubMed=118 91768 ; 

RA Ng C.E.L., Tang B . L . ; 

RT "Nogos and the Nogo-66 receptor: factors inhibiting CNS neuron 

RT regeneration . " ; 

RL J. Neurosci. Res. 67:559-565(2002). 

CC -!- FUNCTION: Potent neurite outgrowth inhibitor which may also help 

CC block the regeneration of the nervous central system in adults. 

CC Isoform 2 reduces the anti-apoptot ic activity of Bcl-xl and Bcl-2. 

CC This is likely consecutive to their change in subcellular 

CC location, from the mitochondria to the endoplasmic reticulum, 

CC after binding and sequestration. 

CC -!- SUBUNIT: Binds to RTN4R. Interacts with Bcl-xl and Bcl-2. 

CC -!- SUBCELLULAR LOCATION: Integral membrane protein. Endoplasmic 

CC reticulum. Anchored to the membrane of the endoplasmic reticulum 

CC through 2 putative transmembrane domains. 

CC -!- ALTERNATIVE PRODUCTS: 

CC Event ^Alternative splicing; Named isof orms=4 ; 

CC Name=l; Synonyms=RTN 4A, Nogo-A, RTN-xL; 

CC IsoId=Q9NQC3-l; Sequence=Displayed; 

CC Name=2; Synonyms-RTN 4B, Nogo-B, RTN-xS, Foocen-M; 

CC IsoId=Q9NQC3-2; Sequence=VSP_005655 ; 

CC Name=3; Synonyms -RTN 4C, Nogo-C, Foocen-S; 

CC IsoId=Q9NQC3-3; Sequence=VSP_005652 , VSP_005653; 

CC Name =4 ; 

CC IsoId=Q9NQC3-4; Sequence=VSP_005654 ; 

CC -!- TISSUE SPECIFICITY: Isoform 1 is specifically expressed in brain 
CC and testis and weakly in heart and skeletal muscle. Isoform 2 is 

CC widely expressed excepted for the liver. Isoform 3 is expressed in 

CC brain, skeletal muscle and adipocytes. Isoform 4 is t.estis- 

CC specific. 

CC -!- SIMILARITY: Contains 1 reticulon domain. 

CC -!- CAUTION: Ref.ll sequence differs from that shown due to 

CC frameshifts in positions 1149 and 1156. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; AJ251383; CAB99248.1; -. 

DR EMBL; AJ251384; CAB99249.1; -. 

DR EMBL; AJ251385; CAB99250.1; -. 

DR EMBL; AB040462; BAB18 927.1; 

DR EMBL; AB040463; BAB18928.1; 

DR EMBL; AF148537; AAG12176.1; -. 

DR EMBL; AF148538; AAG12177.1; 

DR EMBL; AF087901; AAG12205.1; -. 
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EMBL 
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DR 
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• AAD3 1021 
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DR 


EMBL 


• AF132048 


• AAD31022 


• 1; 




DR 


EMBL 


• AB015639 


■ BAA83712 


.1; 




DR 


EMBL 


■ AF077050 


• AAD2 77 83 


l . 




DR 


EMBL 


■ AF177332 


• AAG17976 


1 • 




DR 


EMBL 


• AB020693 


• BAA74909 


.1; 




DR 


EMBL 


• BC001035 


* AAH01035 


• 1; 




DR 


EMBL 


* BC007109 


- AAH07109 


■ 1; 




DR 


EMBL 


■ BC014366 


• AAH14366 


• 1; 




Query Match 


75. 


3%; 


Score 


Best Local Similarity 75. 


9%; 


Pred. 



Matches 909; Conservative 104; 



44 03.5; DB 1; 
No. 3e-159; 
Mismatches 145; 



Length 1192; 
Indels 39; Gaps 
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1 MEDIDQSSLVSSSTDSPPRPPPAFKYQFVTEPEDEEDEEEEEDEEEDDEDLEELEVLERK 

1 1 H 1 1 Mill MINI MINIM MMM 1 1 M M I M M I II M II I II II 

1 MEDLDQSPLVSSS-DSPPRPQPAFKYQFVREPEDEE-EEEEEEEEDEDEDLEELEVLERK 



61 



59 



PAAGLSAAAVP- -PAAAAPLLDFSSDSVPPAPRGPLPAAPPAAPERQPSWERSPAA P 

IMMMI M Ml MMM M I [ 1 1 1 1 1 1 1 1 1 1 II I ! ! I M I h M : I 

PAAGLSAAPVPTAPAAGAPLMDFGNDFVPPAPRGPLPAAPPVAPERQPSWDPSPVSSTVP 



116 APSLPPAAAVLPSKLPEDDEPPARPPPPPPAGASPLAE PAAPPSTPAAPKR 

Ml MM 1 1 1 1 f 1 1 1 1 1 1 1 J 1 1 ! f f 1 1 II II MIIIIIIMIII 

119 APSPLSAAAVSPSKLPEDDEPPARPPPPPPASVSPQAEPVWTPPAPAPAAPPSTPAAPKR 
167 RG-SGSVDETLFALPAASEPVI PSSAEKIMDLMEQPGNTVSSGQEDFPSVLLETAASLPS 

II I MM Ml II Ml Ml II I MM 1 1 1 1 1 1 1 1 M h II 1 1 1 II M 1 1 II II II I 

17 9 RGSSGSVDETLFALPAASEPVIRSSAEN-MDLKEQPGNTISAGQEDFPSVLLETAASLPS 
226 LSPLSTVSFKEHGYLGNLSAVSSSEGTIEETLNEASKELPERATNPFVNRDLAEFSELEY 

Mill Mill MMM I -n - -Mi l Ml -III Mill 

238 LSPLSAASFKEHEYLGNLSTVLPTEGTLQENVSEASKEVSEKAKTLLIDRDLTEFSELEY 
28 6 SEMGSS FKGS PKGESA I LVENTKEEVI VRSKDKED - LVCSAALHS PQES P VG KED 

MM'! Ml MMM I MIMMMMM II = IM II I III 

2 98 SEMGSSFSVS PKAESAVI VANPREE 1 1 VKNKDEEEKLVSNN I LHNQQELPTALTKLVKED 
34 0 RWSPEKTMDIFNEMQMSWAPWEEYADFKPFEQAWEVKDTYEGSRDVLAA RANV 

III II I III = l MM Mill MMM MMM I I Mill 

3 58 E WSSEKAKDSFNEKRVAVEAPMREEYADFKPFERVWEVKDSKEDS -DMLAAGGKI ESNL 



20; 
60 
58 
115 
118 
166 
178 
225 
237 
285 
297 
339 
357 
395 
416 
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3 96 ESKVDRKCLEDSLEQKSLGKDSEGRNEDASFPSTPEPVKDSSRAYITCASFT-SATESTT 4 54 

MMMM Mill : MM Ml IIIMM Ml I MMM I Mill 

417 ESKVDKKCFADSLEQTNHEKDSESSNDDTSFPSTPEGIKDRSGAYITCAPFNPAATESIA 4 76 
455 ANTFPLLEDHTSENKTDEKKI EERKAQI I TEK-TSPKTSNPFLVAVQDSEADYVTTDTLS 513 

I MM I IIIIMMIIIIIMIIIMM II 1 1 1 i 1 1 1 1 1 MM MMM M 

477 TNI FPLLGDPTSENKTDEKKI EEKKAQI VTEKNTSTKTSNPFLVAAQDSETDY VTTDNLT 53 6 
514 KVTEAAVSNMPEGLTPDLVQEACESELNEATGTKIAYETKVDLVQTSEAIQESLYPTAQL 573 

1 1 1 1 hlllllllllllllllllllll INI! 1 1 llhll III II :|||||| III 

537 KVTEEWANMPEGLTPDLVQEACESELNEVTGTKIAYETKMDLVQTSEVMQESLYPAAQL 596 
574 CPSFEEAEATPSPVLPDIVMEAPLNSLLPSAGASWQPSVSPLEAPPPVSYDSIKLEPEN 633 

III IIMIIIIMilllllllllll :|IMMhlll Mill hhlll 1 1 1 1 



Db 



597 CPSFEESEATPSPVLPDIVMEAPLNSAVPSAGASVIQPSSSPLEA-SSVNYESIKHEPEN 655 
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b y z 
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PPPYEEAMSVSLKKVSGI KERT RFPFNTNAATiOFTPAPYT npriT ivt^tvi cappnpnp 
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693 
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Nihil h -111111111111 h| || h| Mill 1 

SDYSEMAKVEOPVPDH^FT.VFn^^Pn^FPVriT.FCinnQT xtvtpqt r ri?'-pc 


IIS 


Qy 


753 


-ETVAOHK-EFRL^A^PnFTinKPYTiF^FOPMT.WCITKnA a QMFiT dtt tvvt^vi ct niwurD 

• LJ x V "V 11IV xjXjrYXJOJriO IT ^CjXJVjriYir lUDOr IM-LirlO 1 iVL/i-i - /\Ol\JXJ X Jr 1 J_i 1 iViVlj ivl O XjU1 v 1 Ji Jl Jr 


809 


Db 


776 


1- = I : 1 1 1 1 1 III ■ 1 MM Ihlllll Mill 

FESMI EYENKEKLSALPPFGGKPYTiF^FKT.QT.nMTKnTT T PnFVQTT cwpittdt r>Mi?T7T 
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835 


Qy 


810 


NTAI YSNDDLLS^KEDKT KF^FTF^nQciPT FT TriPPPTWQa vnna wi AirwTnT mrn 


868 


Db 
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=11=111111 III MMMIMMIIIIMIIMI MM II IMMIIIMM 

STAVYSMDDLFT^KFADTRFTFTFQnciQPTFTTnFPPTT t cci^TncuovT aditvt'ot l?t7-c 
^ v J- oiMx^i_/xjL x oxvrurt.^ x ia-JZj 1 Lj X r oJJoor 1 Ji X 1 UUjI 4 JrlljlooJ\± Do r ,S K 1 iA K ri Y 1 DLli Vo 
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Qy 
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DKSEIANT O^nAn^T.PPT.FT.PrTYT .QPIYMT VP IV n^TUVCnrucTTMDCc^jcv^ ot otic-mtt 
xjiwxjx^-u.^ x ^oo^i^oxji^v_.xjJ2jXjJr^XJXj or x\xM X x trtx — JJCiVnVbUijr otWKbb VoKAb 1 orbNV 
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SALEPOTEMGSIVK^K^TiTKFAFKKT,PQnTFK"FnpciT ^znrr cat?t CifTC\AmT t vtaTdtit v 
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MMMIMIIIMMMIIIIIIIIIIIIMIIIIIIIIMIIIIIMIIIIIIIIIII 

KTGWFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRI YKGVIQAIQKSDEGHPFRA 
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Qy 
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YLESEVAI SEELVQKYSNSALGHVNSTI KELRRLFLVDDLVDSLKFAVLMWVFTYVGALF 


1106 


Db 
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1 i I 1 I , i ! , IMMMMMMMMIMMMIMMIMM 

YLESEVAI SEELVQKYSNSALGHVNCTI KELRRLFLVDDLVDSLKFAVLMWVFTYVGALF 


1135 


Qy 
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NGLTLL I LAL ISLFSI PVI YERHQVQ I DH YLGLANKS VKDAMAKI QAKI PGLKRKAD 1163 


Db 


1136 


IMIIIIIIIIMIhllllllll llllllllllhlllllllllllllMIMI: 

NGLTLLI LALI SLFSVPVI YERHQAQI DHYLGliANKNVKDAMAKI QAKI PGLKRKAE 1192 



RESULT 3 
RTN4JYI0USE 

ID RTN4_M0USE STANDARD; PRT; 199 AA. 

AC Q9 9P72; Q9CTE3; 

DT 28-FEB-2003 (Rel . 41, Created) 

DT 28-FEB-2003 (Rel. 41, Last sequence update) 

DT 28-FEB-2003 (Rel. 41, Last annotation update) 

DE Reticulon 4 (Neurite outgrowth inhibitor) (Nogo protein) . 

GN RTN4 OR NOGO. 

OS Mus musculus (Mouse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus. 

OX NCBI_TaxID=10090 ; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=3T3-L1; TISSUE=Adipocyte ; 

RA Coulson A.C., Craggs P.D., Morris N.J.; 

RT "Mouse vp20/RTN4C CDNA. " ; 



RL Submitted (DEC-2000) to the EMBL / GenBank/ DDB J databases 
RN [2] 

RP SEQUENCE OF 170-199 FROM N.A. 
RC STRAIN=C57BL/6J; TISSUE=Embryo ; 
RX MEDLINE=21085660; PubMed=l 12 1785 1 ; 

RA Kawai J . , Shinagawa A., Shibata K. , Yoshino M. , Itoh M. , Ishii Y. , 

RA Arakawa T. , Hara A., Fukunishi Y. , Konno H., Adachi J. , Fukuda S., 

RA Aizawa K. , Izawa M., Nishi K. ,-Kiyosawa H. , Kondo S., Yamanaka I., 

RA Saito T. , Okazaki Y. , Gojobori T. , Bono H. , Kasukawa T. , Saito R. , 

RA Kadota K. , Matsuda H.A. , Ashburner M. , Batalov S., Casavant T., 

RA Fleischmann w. , Gaasterland T. # Gissi C. , King B., Kochiwa H., 

RA Kuehl P., Lewis S., Matsuo-Y., Nikaido I., Pesole G., Quackenbush J . , 

RA Schriml L.M. , Staubli F . , Suzuki R. # Tomita M. , Wagner L. # Washio T. , 

RA Sakai K. , Okido T. , Furuno M. , Aono H. , Baldarelli R. , Barsh G., 

RA Blake J., Boffelli D., Bojunga N., Carninci P., de Bonaldo M.F., 

RA Brownstein M.J. , Bult C. , Fletcher C. , Fujita M. , Gariboldi M.,' 

RA Gustincich S w Hill D . , Hofmann M . , Hume D.A. , Kamiya M. , Lee N.H., 

RA Lyons P., Marchionni L. , Mashima J., Mazzarelli J. , Mombaerts P., 

RA Nordone P., Ring B., Ringwald M. , Rodriguez I., Sakamoto N. , 

RA Sasaki H. , Sato K. , Schoenbach C. , Seya T. , Shibata Y. , Storch K.-F., 

RA Suzuki H., Toyo-oka K. , Wang K.H., Weitz C. , Whittaker C, Wilming l! , 

RA Wynshaw-Boris A., Yoshida K. , Hasegawa Y. , Kawa j i H., Kohtsuki S., 

RA Hayashizaki Y. ; 

RT "Functional annotation of a full-length mouse cDNA collection."; 

RL Nature 409:685-690(2001). 

CC -!- FUNCTION: Potent neurite outgrowth inhibitor which may also help 
CC block the regeneration of the nervous central system in adults (By 

CC similarity) . 

CC -!- SUBUNIT: Binds to RTN4R. Interacts with Bcl-xl and Bcl-2 (By 
CC similarity) . 

CC -!- SUBCELLULAR LOCATION: Integral membrane protein. Anchored to the 
CC membrane of the endoplasmic reticulum through 2 putative 

CC transmembrane domains (By similarity) . 

CC -!- ALTERNATIVE PRODUCTS: 

CC Event ^Alternative splicing; Named isoforms=l; 

cc Comment =A number of isoforms may be produced; 

CC Name=l; 

CC IsoId=Q99P72-l; Sequence=Displayed ; 

CC -!- SIMILARITY: Contains 1 reticulon domain. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; AF326337; AAK08076.1; 

DR EMBL; AK0038 59; -; NOT_ANNOTATED_CDS . 

DR MGD; MGI: 1915835; Rtn4 . 

DR GO; GO:0030176; C : endoplasmic reticulum membrane, intrinsic p. . .; ISS. 

DR GO; GO: 0005783; C : endoplasmic reticulum; IDA. 

DR GO; GO: 0005635; C: nuclear membrane; ISS . 

DR GO; GO: 0005515; F:protein binding activity; ISS. 

DR GO; GO: 0019987; P: negative regulation of anti -apoptosis ; ISS. 

DR GO; GO: 0030517; P:negative regulation of axon extension; ISS. 



DR 


Inter Pro; 


IPR00338 


18; Reticulon. 


DR 


Pfam; PF02453; Reticulon; 


1 . 




DR 


PROSITE; 


PS50845; 


RETICULON; 


1. 


KW 


Endoplasmic reticulum; Alternative splicing; Transmembrane 


FT 


DOMAIN 


1 


25 




CYTOPLASMIC (Potential) . 


FT 


TRANSMEM 


26 


50 




POTENTIAL. 


FT 


DOMAIN 


51 


137 




LUMENAL (Potential) . 


FT 


TRANSMEM 


138 


162 




POTENTIAL. 


FT 


DOMAIN 


163 


199 




CYTOPLASMIC (Potential) . 


FT . 


DOMAIN 


12 


199 




RETICULON. 


SQ 


SEQUENCE 


199 AA; 


22466 


MW; 


07BE5D580059ED9C CRC64 ; 



Query Match 15.7%; 
Best Local Similarity 97.9%; 
Matches 187; Conservative 



Score 917; DB 1; Length 199; 
Pred. No. l.le-2 8; 
1; Mismatches 3; Indels 



0; Gaps 



0; 
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973 KTS WDLLYWRDI KKTGWFGASLFLLLSLTVFS I VS VTAYI ALALLS VTI SFRI YKGVI 1032 
I 

9 KDKWDLLYWRDI KKTGWFGASLFLLLSLTVFS I VS VTAYI ALALLS VTI SFRI YKGVI 68 
1033 QAIQKSDEGHPFRAYLESEVAISEELVQKYSNSALGHVNSTIKELRRLFLVDDLVDSLKF 1092 

69 QAIQKSDEGHPFRAYLESEVAISEELVQKYSNSALGHVNSTIKELRRLFLVDDLVDSLKF 128 
1093 AVLMWVFTYVGALFNGLTLLI LALI SLFS I PVIYERHQVQI DHYLGLANKS VKDAMAKI Q 1152 

1 1 1 1 1 1 1 HI li HI MINI MINIM Illllllll 

129 AVLMWVFTYVGALFNGLTLL I LALI SLFS I PVI YERHQAQI DHYLGLANKS VKDAMAKI Q 188 

1153 AKI PGLKRKAD 1163 

IMMIIMI: 
18 9 AKI PGLKRKAE 19 9 



RESULT 4 
RTN1_RAT 

ID RTN1_RAT STANDARD; PRT; 777 AA. 

AC Q64548; Q64547; 

DT 16-OCT-2001 (Rel. 40, Created) 

DT 16-OCT-2001 (Rel. 40, Last sequence update) 

DT 15-SEP-2003 (Rel. 42, Last annotation update) 

DE Reticulon 1 (Neuroendocrine-specif ic protein) (S-rex) . 

GN RTN1 OR NSP. 

OS Rattus norvegicus (Rat) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi • 

OC Mammalia; Eutheria; Rodent ia ; Sciurognathi ; Muridae; Murinae; Rattus 

OX NCB I JTax I D = 1 0 1 1 6 ; 

RN [1] 

RP SEQUENCE FROM N.A. (ISOFORMS RTN1-B AND RTN1-S) . 

RC STRAIN=:Wistar; TISSUE=Brain cortex; 

RX MEDLINE=96386034; PubMed=8 793 864 ; 

RA Baka I.D., Ninkina N.N., Pinon L.G.P., Adu J., Davies A.M., 

RA Georgiev G.P., Buchman V.L.; 

RT "Intracellular compartmental ization of two differentially spliced s- 

RT rex/NSP mRNAs in neurons . " ; 

RL Mol. Cell. Neurosci. 7:289-303(1996). 

CC -!- FUNCTION: MAY BE INVOLVED IN NEUROENDOCRINE SECRETION OR IN 
CC MEMBRANE TRAFFICKING IN NEUROENDOCRINE CELLS. 



CC -!- SUBCELLULAR LOCATION: ENDOPLASMIC RETICULUM MEMBRANE (BY 

CC SIMILARITY) . 

CC -!- ALTERNATIVE PRODUCTS: 

CC Event -Alternative splicing; Named isoforms=2; 

CC Name=RTNl-B; Synonyms=S-RexB; 

CC IsoId=Q64548-l; Sequence-Displayed; 

CC Name =RTN 1 - S ; Synonyms =S -RexS ; 

CC IsoId=Q64548-2; Sequence=VSP_005647 , VSP_005648; 

CC -!- TISSUE SPECIFICITY: EXPRESSED PREDOMINANTLY IN CENTRAL AND 

CC PERIPHERAL NERVOUS SYSTEM OF NEWBORN AND ADULT RATS. LOW LEVELS 

CC HAVE BEEN ALSO DETECTED IN HEART, ADRENAL GLAND AND SPLEEN . 

CC EXPRESSION OF ISOFORM RTN1-B IS RESTRICTED TO PARTICULAR NEURONAL 

CC TYPES . 

CC -!- DEVELOPMENTAL STAGE: DETECTED ON EMBRYONIC DAY E10 IN THE 

CC HINDBRAIN AND IN Ell IN THE FOREBRAIN . DURING THE NEXT 3 EMBRYONIC 

CC DAYS THE LEVELS OF S-REXS INCREASES AND REMAINS STABLE AT E13 IN 

CC THE HINDBRAIN AND AT E14 IN THE FOREBRAIN. THE LEVELS OF S-REXB 

CC DOES NOT CHANGE AS SIGNIFICANTLY DURING DEVELOPMENT OF THE 

CC HINDBRAIN. 

CC -!- SIMILARITY: Contains 1 reticulon domain. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http: //www. isb-sib. ch/announce/ 

CC or send an email to 1 icense@isb-sib . ch) . 

CC 

DR EMBL; U176 04; AAC53 04 6.1; 

DR EMBL; U17603; AAC53045.1; -. 

DR InterPro; IPR003388; Reticulon. 

DR Pfam; PF02453; Reticulon; 1. 

DR PROSITE; PS50845; RETICULON; 1. 

Endoplasmic reticulum; Alternative splicing; Transmembrane. 



KW 



FT 


TRANSMEM 


604 


624 


POTENTIAL. 


FT 


TRANSMEM 


727 


747 


POTENTIAL . 


FT 


DOMAIN 


590 


777 


RETICULON. 


FT 


DOMAIN 


610 


613 


POLY -LEU. 


FT 


VARSPLIC 


1 


569 


Missing (in isoform RTN1-S) . 


FT 








/FTId=VSP_005647 . 


FT 


VARSPLIC 


570 


589 


I PGPLGSDLVPPLPFFNKQK -> ■MQATADSTKMDCVWSNW 


FT 








KSQ (in isoform RTN1-S) . 


FT 








/F'TId=VSP_005648. 


SQ 


SEQUENCE 


777 AA; 


83001 


MW; AF7479C50F28D0AC CRC64 ; 



Query Match 13.7%; Score 801; DB 1; Length 777; 

Best Local Similarity 32.4%; Pred. No. 1.5e-23; 

Matches 258; Conservative 109; Mismatches 264; Indels 166; Gaps 32; 

.QY 485 EKTSPKTSNPFLVAVQDSE ADYVTTDTL SKVTEAAVSNMPEGL- -TPD 530 

1 = =|| : I I || | | |:: | :: |: |: || 

Db 2 9 EEATPKGARP AQQDGEPAWGSGAGAGWSSRGLCSGPARS PPVAMETASTGVAAVPD 85 

QY 531 LVQEACES ELNEATG TKI AYETKVDLVQTSEAI Q - ESLYPTAQLCPSFEEAEA 582 

1=1 = I I = I = : | | : : | | | 
Dt > 86 ALDHSSSPTLKDGEGACYTSLISDICYPPREDSAYFTGILQKENGHITTSESP EELG 142 



Qy 583 TPSPVLPDIVMEAPLNSLLPSAGASWQPS VSPLEAPPPVSYDSIKLE 630 

II I II- 1 = 111 - h h M I I :| I 

Db 143 TPGPSLPEVPGTEP - HGLLSSDSGI EMTPAESTEVNKI LADP LDQMKAEACKYI DI T 198 

Qy 631 PENPPPYEEAMNVALK ALGTK- EGI KEPE SFNAAVQETE 668 

I I I == I • II Ih I I ^ I 

Db 199 RPQEAKGQEEQSPGLEDKDLDFKDKDSEVSTKPEGVHAPNQPSPVEGKLIKDNLFEESTF 258 

Qy 669 APYI S I ACDLI KETKLSTEPSPDFSNYSEI AKFEKSVPEHAELVEDSSPESEPVD 723 

MM I = = ==| : I Ml -Ml M IM I 

Db 259 APYIDELSD- -EQHRMSLVTAPVKITLTEIGPPVMTATHETI PEKQDLCLKPSPDTVPTV 316 

Qy 724 LFSDDSIPEVPQTQEEAVMLMKESLTEVSETVAQHK EERLSASPQELGKPYLESF 778 

h h = MM Ml MM •• I I 

Db 317 TVSE PEDDSPGSVTPPSSGTEPSAAESQGKGSVSEDELIAAIKEAKGLSYETTE 37 0 

Qy 779 QPN LHSTKDAASNDI PTLTKKEKI SLQMEEFNTAI YSNDDLLSSKEDKI KESETF 833 

I I M Mh M Ml M 

Db 371 SPRPVGQAADRPKVKARSGLPTI PS SLDQEASSAESGDSEI ELV 414 

Qy 834 SDSSPIEIIDEFPT F VS AKDDS PK1AK- EYTDLE VSDKS E I AN I QSGADSLP CL. 886 

1= h I h II II MM : M : II : 

Db 415 SE-DPMASEDALPSGYVSFGHVSGPPPSPASPSIQYSILREEREAEL DSELII 4 66 

Qy 887 ELPCDL-SFKNI YPKDEVH VSDEFSENRSSVSKASISPSNVSALEPQTEM 935 

I II I III II I IM Mh Ml = 

Db 467 E-SCDASSASEESPKREQDSPPMKPGVLDAIREETSSRATEERAPSHQGPVEPDPILSFT 52 5 
Qy 936 GSIV KSKSLTKEAE KKLPSDTEKEDRSLSAVLSAEL- - 971 

IM 11= M MM I : MM 

Db 526 PVTLQSRPEPSSGDGAPVPEPPKSQQQKPEEEAVSSSQSPAATE IPGPLGSDLVP 58 0 

Qy 972 S KTS WDLLYWRD I KKTG WFGASLFLLLSLTVF S I VS VTAY I ALALLS VT I SFR 1026 

M -MMMM'M ! I 1 1 Ml MM 1 1 MM II 1 1 Mill 

Db 581 PLPFFNKQKAIDLLYWRDIKQTGI VFGSFLLLLFSLTQFSWSWAYLALAALSATISFR 64 0 

Qy 1027 I YKGVI QAI QKSDEGHPFRAYLESEVAI SEELVQKYSNSALGHVNSTI KELRRLFLVDDL 1086 

Ml IMIMIMIIIIIMIII h MM MM- M I II M II I M I II II 

Db 641 IYKSVLQAVQKTDEGHPFKAYLELEITLSQEQIQKYTDCLQLYVNSTLKELRRLFLVQDL 700 

Qy 1087 VDSLKFAVLMWVFTYVGALFNGLTLLILALISLFSIPVIYERHQVQIDHYLGLANKSVKD 1146 

M II M 1 1 1 II _M M I M II M II M M M M I M Ml IM MM 

Db 701 VDSLKFAVLMWLLTYVGALFNGLTLLLI^WSMFTLPVVYVKHQAQVDQYLGLVRTHINT 76 0 

Qy 114 7 AMAKIQAKI PGLKRKAD 1163 

MIMIMM II 
Db 761 WAKI QAKI PGAKRHAE 777 



RESULT 5 
RTN INHUMAN 

ID RTN1_HUMAN STANDARD; PRT; 776 AA. 

AC Q16799; Q16800; Q16801; 

DT 16-OCT-2001 (Rel . 40, Created) 

DT 16-OCT-2001 (Rel. 40, Last sequence update) 

DT 15-SEP-2003 {Rel. 42, Last annotation update) 



DE Reticulon 1 (Neuroendocrine-specif ic protein) . 

GN RTN1 OR NSP. 

OS Homo sapiens (Human) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

OX NCBI_TaxID=9606 ; 

RN [1] 

RP SEQUENCE FROM N.A. (ISOFORMS RTN1-A; RTN1-B AND RTN1-C) . 

RC TISSUE=Lung carcinoma; 

RX MEDLINE=93293865; PubMed=7685762 ; 

RA Roebroek A. J.M. , Van de Velde H.J.K., Van Bokhoven A-., Broers J.L.V., 

RA Ramaekers F.C.S., Van de Ven W.J.M.; 

RT "Cloning and expression of alternative transcripts of a novel 

RT neuroendocrine-specif ic gene and identification of its 135-kDa 

RT translational product."; 

RL J . Biol. Chem. 268:1343 9-13447(1993). 

RN [2] 

RP ALTERNATIVE SPLICING . 

RX MEDLINE=96429995; PubMed-8833 14 5 ; 

RA Roebroek A. J.M. , Ayoubi T.A.Y., Van de Velde H.J.K., 

RA Schoenmakers E.F.P.M., Pauli I.G.L., Van de Ven W.J.M.; 

RT "Genomic organization of the human NSP gene, prototype of a novel gene 

RT family encoding ret iculons . " ; 

RL Genomics 32:191-199(1996). 

RN [3] 

RP TISSUE SPECIFICITY. 

RX MEDLINE=98228245; PubMed=95 6 04 66 ; 

RA Hens J . , Nuydens R. , Geerts H., Senden N.H., Van de Ven W.J.M., 

RA Roebroek A.J., van de Velde H.J., Ramaekers F.C., Broers J.L.; 

RT "Neuronal differentiation is accompanied by NSP-C expression."; 

RL Cell Tissue Res. 292:229-237(1998). 

CC -!- FUNCTION: MAY BE INVOLVED IN NEUROENDOCRINE SECRETION OR IN 

CC MEMBRANE TRAFFICKING IN NEUROENDOCRINE CELLS. 

CC -!- SUBCELLULAR LOCATION: Endoplasmic reticulum membrane. 

CC - ! - ALTERNATIVE PRODUCTS : 

CC Event ^Alternative splicing; Named isoforms=3; 

CC Name=RTNl -A ; Synonym s= NSP -A; 

CC IsoId=Q16799-l; Sequence=Displayed; 

CC Name=RTNl-B ; Synonyms =NSP-B; 

CC IsoId=Q16799-2; Sequence=VSP_00564 4 ; 

CC Name=RTNl - C ; Synonyms =NSP-C; 

CC IsoId=Q16799-3; Sequence=VSP_005645 , VSP_005646; 

CC -!- TISSUE SPECIFICITY: EXPRESSED IN NEURAL AND NEUROENDOCRINE TISSUES 
CC AND CELL CULTURES DERIVED THEREFROM. EXPRESSION OF ISOFORM RTN1-C 

CC IS STRONGLY CORRELATED WITH NEURONAL DIFFERENTIATION . 

CC -!- PTM: ISOFORMS RTN1-A AND RTN1-B ARE PHOSPHORYLATED . 

CC -!- SIMILARITY: Contains 1 reticulon domain. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; L10333; AAA59950.1; -. 



DR 


EMBL ; L10334; AAA59951.1; 


DR 


EMBL; L10335; AAA59952.1; 


DR 


PIR; A46583; A4658 


3 . 


DR 


PIR; 160904; 160904. 


DR 


Genew; HGNC : 104 67 ; 


RTN1 . 


DR 


MIM; 600865; -. 




DR 


GO; GO:0030176; C: 


endoplasmic reticulum membrane, intrinsic p. . .; TAS 


DR 


GO; GO: 0004871; F: 


signal transducer activity; NAS . 


DR 


GO; GO:0030182; P: 


neuron differentiation; TAS. 


DR 


GO; GO:0007165; P: 


signal transduction; NAS. 


DR 


InterPro; IPR00338 


18; Reticulon. 


DR 


Pfam; PF02453; Reticulon; 1. 


DR 


PROSITE; PS50845; 


RETICULON; 1. 


KW 


Endoplasmic reticulum; Alternative splicing; Transmembrane; 


KW 


Phosphorylation . 




FT 


TRANSMEM 603 


623 POTENTIAL. 


FT 


TRANSMEM 72 6 


746 POTENTIAL. 


FT 


DOMAIN 589 


776 RETICULON. 


FT 


DOMAIN 609 


612 POLY-LEU. 


FT 


VARSPLIC 1 


420 Missing (in isoform RTN1-B) . 


FT 




/FTId=VSP_005644 . 


FT 


VARSPLIC 1 


568 Missing (in isoform RTN1-C) . 


FT 




/FTId=VSP_005645 . 


FT 


VARSPLIC 569 


588 GPGPLGPGAPPPLLFLNKQK -> MQATADSTKMDCVWSNW 


FT 




KSQ (in isoform RTN1-C) . 


FT 




/FTId=VSP_005646 . 


SQ 


SEQUENCE 7 76 AA; 


: 83617 MW; CA5B62323 53 0 96FE CRC64 ; 



Query Match 13.5%; Score 78 9.5; DB 1; Length 776; 

Best Local Similarity 31.2%; Pred. No. 4.1e-23; 

Matches 243; Conservative 115; Mismatches 252; Indels 169; Gaps 28; 



Qy 


487 


TSPKTSNPFLVAVQDSEADYVTTDTLSKVTEAAVSNMPEGLTPDLVQEACESELNEATGT 


546 


Db 


65 


SGPARQS P - - VAMETASTGVAGVSSAMDHTFSTTS KDGEG SCYTSLI S 


110 


Qy 


547 


KIAYETKVDLVQTSEAIQ-ESLYPTAQLCPSFEEAEATPSPVLPDI - - VMEAPLNS 

I I : I : :| |: : | 1 1 1 1 1 Mh : 1 1 


599 


Db 


111 


DICYPPQEDSTYFTGILQKENGHVTISESP EELGTPGPSLPDVPGI ESRGLFSSDSG 


167 


Qy 


600 


--LLPSAGASWQPSVSPLEAPPPVSY DSI KLEPENPPPYEEA M 


641 


Db 


168 


: |: I : lh : 1 : :| : :: | |: 

IEMTPAESTEVNKILADPLDQMKAEAYKYIDITRPEEVKHQEQHHPELEDKDLDFKNKDT 


227 


Qy 


642 


NVALKALGTKEG IKEPE SFNAAVQETEAP Y I S IACDL I KETKLSTE - PS P 


690 


Db 


228 


::-| MMh : : 1 MM M M = " = = M 
DISIK PEGVREPDKPAPVEGKI I KDHLLEESTFAPYID DLSEEQRRAPQITTP 


280 


Qy 


691 


DFSNYSEIAKFEKSVPEHAELVEDSSPESEPVDLF-SDDSI PEV PQTQEEAVMLMK 


745 


Db 


281 


:|| 1 :: :|| : : | | |::| | |: 

VKI TLTE I E PS VETTTQEKTPEKQD I CLKPS PDTVPTVTVS EPEDDS PGS I TPP 


334 


Qy 


746 


ESLTEVSETVAQHK EERLSASPQELGKPYLESFQP NLHSTKDAASND 

MM Mi M -1 = 1 1 : M 1 


792 


Db 


335 


SSGTEPSAAESQGKGS I SEDELI TAI KEAKGLS YETAENPR P VGQLADRPE VKARSG PPT 


394 


Qy 


793 


IPTLTKKEKISLQMEEFNTAI YSNDDLLSSKEDKI 


827 



Db 



3 95 I PSPLDHEASSAESGDSEI ELVSEDPM- -AAEDALPSGYVSFGHVGGPPPSPASPSIQYS 452 



Qy 828 KESETFSDSS P I E 1 1 DEFPTFVSAKDDS PKLAKEYTDLEVSDKSEI ANIQSG 879 

H I II hi I II -III - - | | 

Db 4 53 I LREEREAELDS ELI I ESCDASSASEES PKREQDS PPMKPSALDAI REETGVRAEER 509 

Qy 880 ADSLPCLELPCDLS FKN I YPKDEVHVSDEFS ENRS S VS KAS I S PSNVSALEPQTEMGS I V 939 

I I I I II = II I I. I Ml- | 

Db 510 APSRRGLAEPG- -SFLD- YPSTEPQPGPEL PPGDGALEPETPM 54 9 

Qy 94 0 KSKSLTKEAEKKLPSDTEKEDRSLSAVLSAE LS KTS WDLLYWRD 984 

II I- | : : : hi : I I I I I I I 

Db 550 LPRKPEEDSSSNQSPAATKGPGPLGPGAPPPLLFLNKQKAIDLLYWRD 597 

Qy 985 I KKTGWFGASLFLLLSLTVFSI VSVTAYIAIjALLSVTI SFRI YKGVI QAIQKSDEGHPF 1044 

Ihlhllh I II III hi: hll II I II I II II hlhlhllllll 

Db 598 I KQTGI VFGSFLLLLFSLTQFSWSWAYLALAALSATI SFRI YKSVLQAVQKTDEGHPF 657 

Qy 1045 RAYLESEVAI SEELVQKYSNSALGHVNSTI KELRRLFLVDDLVDSLKFAVLMWVFTYVGA 1104 

HIM h :hl MM- = I I I h I I I I I I I I I I I I I I I I I I I I I h Mill 

Db 658 KAYLELEITLSQEQIQKYTDCLQFYVNSTLKELRRLFLVQDLVDSLKFAVLMWLLTYVGA 717 

Qy 1105 LFNGLTLLI LALI SLFS I PVI YERHQVQI DHYLGLANKSVKDAMAKI QAKI PGLKRKAD 1163 

I i I I I I I I - |: ^ h h : I h I Ml Ml | | | | : MMIIIMI || 

Db 718 LFNGLTLLLMAWSMFTLPVWVKHQAQI DQYLGLVRTHI NAWAKI QAKI PGAKRHAE 776 



RESULT 6 
RTN3_HUMAN 

ID RTN3_HUMAN STANDARD; PRT; 236 AA. 

AC 095197; 

DT 16-OCT-2001 (Rel. 40, Created) 

DT 16-OCT-2001 (Rel. 40, Last sequence update) 

DT 15-SEP-2003 (Rel. 42, Last annotation update) 

DE Reticulon protein 3 (Neuroendocrine-specif ic protein-like 2) (NSP-like 

DE protein II) (NSPLII) . 

GN RTN3 OR NSPL2 . 

OS Homo sapiens (Human) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

OX NCBI_TaxID=9606 ; 

RN [1] 

RP SEQUENCE FROM N.A., AND TISSUE SPECIFICITY. 

RC TISSUE=Retina; 

RX MEDLINE=99265974; PubMed=10331947 ; 

RA Moreira E.F., Jaworski C.J., Rodriguez I.R.; 

RT "Cloning of a novel member of the reticulon gene family (RTN3) : gene 

RT structure and chromosomal localization to llql3 . " ; 

RL Genomics 58:73-81(1999). 

RN [2] 

RP SEQUENCE FROM N.A. 

RA Huang X . , Zhou Y. , Du G . , Yuan J . , Qiang B . ; 

RT "Cloning and expression analysis of a cDNA encoding a novel 

RT neuroendocrine-specif ic protein-like protein 1: NSPL1 . " ; 

RL Submitted (JAN-1999) to the EMBL/ GenBank / DDB J databases. 

RN [3] 



RP SEQUENCE FROM N.A. 

RC TISSUE=Brain / Eye, and Lymph; 

RX MEDLINE=22388257; PubMed=12477932 ; 

RA Strausberg R.L., Feingold E.A., Grouse L.H. , Derge J.G., 

RA Klausner. R.D. , Collins F.S., Wagner L . , Shenmen CM., Schuler G.D., 

RA Altschul S.F., Zeeberg B . , Buetow K.H., Schaefer C.F., Bhat N.K., 

RA Hopkins R.F., Jordan H., Moore T. , Max S.I., Wang J., Hsieh F., 

RA Diatchenko L. , Marusina K. , Farmer A.A. , Rubin G.M., Hong L. , 

RA Stapleton M., Soares M.B., Bonaldo M.F., Casavant T.L., Scheetz T.E., 

RA Brownstein M.J., Us'din T.B., Toshiyuki S., Carninci P., Prange C. , 

RA Raha S.S., Loquellano N.A., Peters G.J. , Abramson R.D., Mullahy S.J., 

RA Bosak S.A., McEwan P.J., McKernan K.J., Malek J.A. , Gunaratne P.H., 

RA Richards S., Worley K.C., Hale S., Garcia A.M., Gay L.J., Hulyk S.W., 

RA Villalon D.K., Muzny D.M., Sodergren E.J., Lu X. , Gibbs R.A. , 

RA Fahey J., Helton E., Ketteman M. , Madan A., Rodrigues S., Sanchez A., 

RA Whiting M., Madan A., Young A.C., Shevchenko Y. , Bouffard G.G., 

RA Blakesley R.W., Touchman J.W., Green E.D., Dickson M.C., 

RA Rodriguez A.C., Grimwood J., Schmutz J., Myers R.M., 

RA Butterfield Y.S.N. , Krzywinski M.I., Skalska U. , Smailus D.E., 

RA Schnerch A., Schein J.E., Jones S.J.M., Marra M.A. ; 

RT "Generation and initial analysis of more than 15,000 full-length 

RT human and mouse cDNA sequences."; 

RL Proc. Natl. Acad. Sci . U.S.A. 99:16899-16903(2002). 

CC -!- SUBCELLULAR LOCATION: Integral membrane protein. Endoplasmic 

CC reticulum (Potential) . 

CC -!- TISSUE SPECIFICITY: WIDELY EXPRESSED WITH HIGHEST EXPRESSION IN 
CC BRAIN. THREE TIMES MORE ABUNDANT IN MACULA THAN IN PERIPHERAL 

CC RETINA . 

CC -!- SIMILARITY: Contains 1 reticulon domain. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib. ch) . 

CC 

DR EMBL; AF059524; AAC99319.1; -. 

DR EMBL; AF059529; AAD20951.1; -. 

DR EMBL; AF059525; AAD20951.1; JOINED. 

DR EMBL; AF059526; AAD20951.1; JOINED. 

DR EMBL; AF059527; AAD20951.1; JOINED. 

DR EMBL; AF059528; AAD20951 .1; JOINED. 

DR EMBL; AF119297; AAD26810.1; -. 

DR EMBL; BC000634; AAH00634.1; -. 

DR EMBL; BC010556; AAH10556.1; 

DR EMBL; BC011394; AAH11394.1; -. 

DR EMBL; BC022993; AAH22993.1; 

DR Genew ; HGNC : 1 0 4 6 9 ; RTN3 . 

DR MIM; 604249; -. 

DR GO; GO: 0005615; C : extracellular space; TAS . 

DR InterPro; IPR003388; Reticulon. 

DR Pfam; PF02453; Reticulon; 1. 

DR PROSITE; PS50845; RETICULON; 1. 

KW Transmembrane; Endoplasmic reticulum. 

FT TRANSMEM .68 88 POTENTIAL. 



FT TRANSMEM 177 197 POTENTIAL. 
FT DOMAIN 48 236 RETICULON . 

SQ SEQUENCE 236 AA; 25609 MW; DDC6A4 544ABCDFB7 CRC64; 

Query Match 10.7%; Score 625.5; DB 1; Length 236; 

Best Local Similarity 59.1%; Pred. No. 1.4e-17; 

Matches 114; Conservative 41; Mismatches 37; Indels 1; Gaps 1; 

Qy 972 S KTS WDLLYWRD I KKTG WFGASLFLLLSLTVFS I VS VTAY I ALALLS VTI SFR I YKGV 1031 

I : I IhHIhllll III H HIM I I — I I :|: I I I I I I I I I M I I I I 
Db 44 SSCAVHDLI FWRDVKKTGFVFGTTLIMLLSLAAFSVI S WSYLILALLSVTI SFRI YKSV 103 

Qy 1032 IQAIQKSDEGHPFRAYLESEVAISEELVQKYSNSALGHVNSTIKELRRLFLVDDLVDSLK 1091 

Mhllhllllhllh == =| I I h|: hi : I : 

Db 104 I QAVQKSEEGHPFKAYLDVDI TLSSEAFHNYMNAAMVHI NRALKLI I RLFLVEDLVDSLK 163 

Qy 1092 FAVLMWVFTYVGALFNGLTLLILALISLFSIPVI YERHQVQIDHYLGLANKSVKDAMAKI 1151 

II Ih llllhllhllllll : :||:|::||::: ||||h|:| | : || 

Db 164 LAVFMWLMTYVGAVFNGITLLILAELLIFSVPIVYEKYKTQIDHYVGIARDQTKSIVEKI 223 

Qy 1152 QAKI PGL-KRKAD 1163 

Db 224 QAKLPGIAKKKAE 236 

RESULT 7 
RTN3_MOUSE 

ID RTN3_MOUSE STANDARD; PRT; 237 AA. 

AC Q9ES97; 

DT 16-OCT-2001 (Rel . 40, Created) 

DT 16-OCT-2001 (Rel. 40, Last sequence update) 

DT 28-FEB-2003 (Rel. 41, Last annotation update) 

DE Reticulon protein 3. 

GN RTN3 . 

OS Mus musculus (Mouse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus. 

OX NCBI_TaxID=10090; 

RN [1] 

RP SEQUENCE FROM N.A. 

RA Huang X., Zhou Y., Qiang H. , Yuan J . , Qiang B.; 

RT "Cloning and expression profile of a novel mouse cDNA encoding a human 

RT RTN3 homo log. " ; 

RL Submitted (OCT-1999) to the EMBL/GenBank/ DDB J databases. 

CC -!- SUBCELLULAR LOCATION: Integral membrane protein. Endoplasmic 

CC reticulum (Potential) . 

CC -!- SIMILARITY: Contains 1 reticulon domain. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; AF195940; AAG31360.1; 



DR MGD; MGI: 1339970; Rtn3 . 

DR InterPro; IPR003388; Reticulon. 

DR Pfam; PF02453; Reticulon; 1. 

DR PROSITE; PS50845; RETICULON; 1. 

KW Transmembrane; Endoplasmic reticulum. 

FT TRANSMEM 69 8 9 POTENTIAL. 

FT TRANSMEM 167 187 POTENTIAL. 

FT DOMAIN 4 9 237 RETICULON . 

SQ SEQUENCE 237 AA; 25428 MW; EB60A0A7AC45F0DE CRC64 ; 



Query Match 10.7%; Score 625.5; DB 1; 

Best Local Similarity 59.1%; Pred. No. 1.4e-17; 
Matches 114; Conservative 41; Mismatches 37; 



Length 23 7; 
Indels 1; 



Gaps 



1; 



Qy 

Db 



972 SKTSWDLLYWRDI KKTGWFGASLFLLLSLTVFS I VS VTAYIALALLSVTI SFRI YKGV 1031 

I :| IhMlhllll III H Ml lh:|| :|: II II M MM M I 
45 SSCAVHDLI FWRDVKKTGFVFGTTLI MLLSLAAFS VI S WSYLI LALLSVTI SFRVYKSV 104 



QY 
Db 

Qy 

Db 

Qy 
Db 



1032 I QAI QKSDEGHPFRAYLESEVAI SEELVQKYSNSALGHVNSTI KELRRLFLVDDLVDSLK 1091 

llhllhllllhllh - =| I I hh III :| : llllhlllllll 

105 I QAVQKS E EGHP F KA YLD VD I TLS S EAF HN YMNAAM VH VN KALKL 1 1 RLFLVEDLVDS LK 164 

1092 FAVLMWVFTYVGALFNGLTLLI LALI SLFS I PVI YERHQVQI DHYLGLANKSVKDAMAKI 1151 

II M llllhllhllllll : :||:|::||::: |||MM | : 

165 LAVFMWLMTYVGAVFNGI TLLI LAELLVFSVP I VYEKYKTQI DHYVGI ARDQTKS I VEKI 224 

1152 QAKI PGL - KRKAD 1163 

225 QAKLPGIAKKKAE 237 



RESULT 8 
RTN2_MOUSE 

ID RTN2JYIOUSE STANDARD; PRT; 4 71 AA . 

AC 070622; 070620; 

DT 16-OCT-2001 (Rel . 40, Created) 

DT 16-OCT-2001 (Rel. 40, Last sequence update) 

DT 28-FEB-2003 (Rel. 41, Last annotation update) 

DE Reticulon protein 2 (Neuroendocrine- specific protein-like 1) (NSP-like 

DE protein 1) (NSPLI) . 

GN RTN2 OR NSPLI. 

OS Mus musculus (Mouse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Rodent ia; Sciurognathi ; Muridae; Murinae; Mus. 

OX NCBI_TaxID=1009 0; 

RN [1] 

RP SEQUENCE FROM N.A. (ISOFORMS 1 AND 2), AND TISSUE SPECIFICITY. 

RC STRAIN=FVB/N, and 129/Sv ; TISSUE=Cerebellum / and Skeletal muscle; 

RX MEDLINE=98191726; PubMed=953 0622 ; 

RA Geisler J.G., Stubbs L.J. # Wasserman W.W. , Mucenski M.L.; 

RT "Molecular cloning of a novel mouse gene with predominant muscle and 

RT neural expression."; 

RL Mamm. Genome 9:274-282(1998). 

CC -!- SUBCELLULAR LOCATION: Membrane - bound . Endoplasmic reticulum 

CC (Potential) . 

CC - ! - ALTERNATIVE PRODUCTS : 

CC Event ^Alternative splicing; Named isoforms=2; 



CC Name=l; Synonyms -Bra in ; 

CC Isold=070622-1; Sequence=Displayed; 

CC Name=2; Synonyms =Muscle; 

CC IsoId=070622-2; Sequence=VSP_005650 , VSP_005651; 

CC -!- TISSUE SPECIFICITY: EXPRESSED PREDOMINANTLY IN NEURAL AND MUSCULAR 
CC TISSUES. 

CC -!- SIMILARITY: Contains 1 reticulon domain. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; AF038537; AAC14906.1; 

DR EMBL; AF038537; AAC14907.1; -. 

DR EMBL; AF038538; AAC14908.1; -. 

DR EMBL; AF038539; AAC14909.1; 

DR EMBL; AF093624; AAD13195.1; -. 

DR MGD; MGI: 107612; Rtn2 . 

DR InterPro; IPR003388; Reticulon. 

DR Pfam; PF02453; Reticulon; 1. 

DR PROSITE; PS50845; RETICULON; 1. 

KW Endoplasmic reticulum; Alternative splicing; Transmembrane. 

FT TRANSMEM 2 95 315 POTENTIAL. 

FT DOMAIN 272 471 RETICULON. 

FT VARSPLIC 1 267 Missing (in isoform 2). 

FT /FTId=VSP_005650. 

FT VARSPLIC 268 271 PLLL -> MGSK (in isoform 2) . 

FT /FTId=VSP_005651 . 

SQ SEQUENCE 471 AA; 51346 MW; 9BBD8F372CF63AD3 CRC64; 

Query Match 8.7%; Score 509; DB 1; Length 471; 

Best Local Similarity 28.6%; Pred. No. 8.3e-13; 

Matches 146; Conservative 84; Mismatches 175; Indels 106; Gaps 13; 
Qy 690 PDFSNYSEIAKFEKSVPEHAELVEDSSP- -ESEPVDLFSDDSI PEVPQTQEEAVMLMKES 747 
Db 6 PVFAHCKEAPSTASSTPDSTEGGNDDSDFRELHTAREFSED 4 6 

QY 748 LTEVS ETVAQH KEERLSAS PQELGKPYLESFQPNLHST 785 

I M : I : : : h h I I I 

Db 47 --EEEETTSQDWGTPRELTFSYIAFDGWGSGGRRDSWRRPRPQGRSVSEPRDPPQQSG 104 

Qy 786 KDAASNDI PTLTKK EKI SLQMEEFNTAI YSNDDLLSSKEDK 826 

< !hh: h: |:::: : | | || 

Db 105 LGDSLES I PSLSQS PEPGRRGDPDPVPPAERPLEELRLRLDQLGWWRS AGSGED- 159 

QY 827 IKESETFSDSSPI EI IDEFPTFVSAKDDSPKLAKEYTDLEVSDKSEIANIQSGADSLPCL 886 

I hhl =| I : I : II I : I I : :| | 
Db 160 SATSSSTPLE- -NEEPDGLEASE AGEETNLEL RLAQ SL 195 

Qy 887 ELPCDLSFKNI YPKDEVHVSDEFSENRSS VSKAS I S PSNVSALEPQTEMGS I VKSKSLTK 946 

I == - I I II I : I : I | ::: : :| 

Db 196 HLQLEVLTPQLSPSSGTPQAHTPSPQRSQDSNS- -GPDDEPLLNWEEHWRLLEQEPITA 253 



Qy 947 EAE KKL PS DTE KEDRS LS AVLS AELS KTS WDLLYWRD I KKTG VVFGASLFLLLSLTVFS 1006 

I- I : I I llllhl -Ml : || I || 

Db 254 QC LDSTDQSEFMLEPLL LVADLLYWKDTRTSGAVFTGLMASLLCLLHFS 3 02 

Qy 1007 I VS VTAYI ALALLSVTI SFRI YKGVI QAI QKSDEGHPFRAYLESEVAI SEELVQKYSNSA 1066 



Db 3 03 I VS VAAHLALLGLCATI SLRVYRKVLQAVHRGDGTNPFQAYLDMDLTLTREQTERLSQQI 362 

Qy 1067 LGHVNST I KELRRLFLVDDLVDSLKFAVLMWVFTYVGALFNGLTLL I LAL I SLFS I PVI Y 1126 

II II HI llhlllllll hi hllhlllllhll = = < I I : = I = = I 

Db 3 63 ASHVVSTATQLRHFFLVEDLVT3SLKLALLFYI LTFVGAI FNGLTLVILGWALFTVPLLY 422 

Qy 112 7 ERHQVQI DHYLGLANKS VKDAMAKI QAKI PG 1157 

HI III hll : Ilhlllll 

Db 423 RQHQAQI DQYVGLVTNQLSHI KAKI RAKI PG 453 

RESULT 9 
RTN2JHUMAN 

ID RTN2_HUMAN STANDARD; PRT; 54 5 AA. 

AC 075298; 060509; 

DT 16-OCT-2001 (Rel . 40, Created) 

DT 16-OCT-2001 (Rel. 40, Last sequence update) 

DT 15-SEP-2003 (Rel. 42, Last annotation update) 

DE Reticulon protein 2 (Neuroendocrine-specif ic protein-like 1) (NSP-like 

DE protein 1) (NSPLI) . 

GN RTN2 OR NSPLI. 

OS Homo sapiens (Human) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

OX NCBIJTaxID=9606; 

RN [1] 

RP SEQUENCE FROM N.A. , ALTERNATIVE SPLICING, AND TISSUE SPECIFICITY. 

RC TISSUE-Lung carcinoma; 

RX MEDLINE=98360096; PubMed=9693 037 ; 

RA Roebroek A.J.M., Contreras B., Pauli I.G.L., Van de Ven W.J.M.; 

RT " cDNA cloning, genomic organization, and expression of the human RTN2 

RT gene, a member of a gene family encoding ret iculons . " ; 

RL Genomics 51:98-106(1998). 

RN [2] 

RP SEQUENCE OF 108-545 FROM N.A. (ISOFORM RTN2-B) . 

RC TISSUE=Brain ; 

RX MEDLINE=98191726; PubMed=953 0622 ; 

RA Geisler J.G. , Stubbs L.J., Wasserman W.W., Mucenski M.L.; 

RT "Molecular cloning of a novel mouse gene with predominant muscle and 

RT neural expression."; 

RL Mamm. Genome 9:274-282(1998). 

CC -!- SUBCELLULAR LOCATION: Integral membrane protein. Endoplasmic 
CC reticulum (Potential) . 

CC -!- ALTERNATIVE PRODUCTS: 

CC Event=Alternative splicing; Named isoforms^2; 

CC Name=RTN2 -A ; 

CC Isold=075298-1; Sequence=Displayed ; 

CC Note=Isoform RTN2-C is produced by alternative initiation at 

CC Met-341 of isoform RTN2-A; 

CC Name=RTN2-B; 



CC IsoId-075298-2; Sequence=VSP_00564 9 ; 

CC Event =Alternative initiation; 

CC Comment=2 isoforms, RTN2-A (shown here) and RTN2-C, are produced 

CC by alternative initiation at Met-1 and Met-341; 

CC -!- TISSUE SPECIFICITY: ISOFORM RTN2-C IS HIGHLY EXPRESSED IN SKELETAL 
CC MUSCLE . 

CC -!- SIMILARITY: Contains 1 reticulon domain. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib. ch) . 

CC 

DR EMBL; AF004222; AAC32542.1; -. 

DR EMBL; AF004223; AAC32543.1; -. 

DR EMBL; AF004224; AAC32544.1; -. 

DR EMBL; AF038540; AAC14910.1; -. 

DR Genew; HGNC: 10468; RTN2 . 

DR MIM; 603183; -. 

DR GO; GO:0030176; C : endoplasmic reticulum membrane, intrinsic p. . . ; NAS . 

DR GO; GO:0004871; F:signal transducer activity; NAS . 

DR GO; GO: 0007165; P: signal transduction; NAS. 

DR InterPro; IPR003388; Reticulon. 

DR Pfam; PF02453; Reticulon; 1. 

DR PROSITE; PS50845; RETICULON; 1. 

KW Endoplasmic reticulum; Alternative splicing; Transmembrane; 

KW Alternative initiation. 

FT CHAIN 1 545 RETICULON PROTEIN 2, ISOFORM RTN2-A. 

FT CHAIN 341 545 RETICULON PROTEIN 2, ISOFORM RTN2-C. 

FT INITJ4ET 341 341 FOR ISOFORM RTN2-C. 

FT TRANSMEM 368 388 POTENTIAL. 

FT TRANSMEM 4 63 4 83 POTENTIAL. 

FT DOMAIN 345 545 RETICULON. 

FT VARSPLIC 272 344 Missing (in isoform RTN2-B) . 

FT /FTId=VSP_005649. 

SQ SEQUENCE 545 AA; 59263 MW; 971FD2F909E1E9E6 CRC64 ; 

Query Match 8.4%; Score 4 90; DB 1; Length 54 5; 

Best Local Similarity 28.3%; Pred. No. 5.2e-12; 

Matches 160; Conservative 76; Mismatches 202; Indels 128; Gaps 17; 

Qy 668 EAPYI S IACDLI KETKLSTEPSPDFSNYSEI AKFEKSVPEHAELVEDSSPESEPVDL 724 

III I III I I- h :| : | : =| 

Db 13 EAP STASSTPDSTEGGNDDSDFRELHTAREFSEEDEEETTSQDWGTPRELTFSY 66 

Qy 725 FSDDSIPEVPQTQEEAVMLMKESLTEVSETVAQHKEERLSASPQELGKPYL-ESFQPNLH 783 

= I = = = . = III M : | | : : M :| :| 

Db 67 IAFDGWGSGGRRDSTARRPRPQGRSVSEPRDQHPQPSLGDSLESI - - PSLSQSPEPGRR 124 

Qy 784 STKDAASND I PTLTKKEKI SLQMEEFNTAI YSNDDLLSSKEDKI KES ETFSDS S P I E 1 1 D 843 

II h I = I-: III : I hhl 
Db 125 GDPDTAP PSERPLEDLRLRLDHLGWVARGT GSGED SSTSSSTPLE 169 



Qy 



844 EFPTFVSAKDDSPKLAKEYTDLEVSDKSEIANIQSGADSLPCLELPCDLSFKNIYPKDEV 903 



II I 

EL - - D 



Db 


170 


DEEP- - -QEPNRLETGEAGE EL- -DLRLRLAQPSSPE 


2 01 


Qy 


904 


HVSDEFSENRSSVSKASISP SNVSALE PQTE 
\ = = 1 1 II III 


934 


Db 


202 


VLTPQLSPGSGTPQAGTPSPSRSRDSNSGPEEPLLEEEEKQWGPLEREPVRGQCLDSTDQ 


261 


Qy 


935 


MGSIVK SKSLTKEAEKKLP - S 


954 


Db 


262 


:|: :: || | :| 
LEFTVE PRLLGTAMEWLKTS LLLAVYKT VP ILELSPP LWTA I GWVQRG PT P PT PVLRVLL 


321 


Qy 


955 


DTEKEDRSLSAVLSAELSK- - -TSWDLLYWRDIKKTGWFGASLFLLLSLTVFSIVSVT 


1011 


Db 


322 


1 Mill 1 : 1 llllhl : HIM = III III III 
KWAKS PRS - SGVPSLSLGADMGS KVADLLYWKDTRTSGWFTGLMVSLLCLLHFS I VS VA 


380 


Qy 


1012 


AYIALALLSWISFRIYKGVIQAIQKSDEGHPFRAYLESEVAISEELVQKYSNSALGHVN 
1 - . 1 1 1 1 1 1 1 i . 1 . 1 . 1 1 . . 1 .11.111 1 . . I . i 


1071 


Db 


381 


i ' * I ■ 1 ■ 1 • II • ■ 1 * 1 1 • 1 I 1 • ■ • • * * 1 • • 1 ■ 1 

AHLALLLLCGTISLRVYRKVLQAVHRGDGANPFQAYLDVDLTLTREQTERLSHQITSRW 44 0 


Qy 


1072 


STI KELRRLFLVDDLVDSLKFAVLMWVFTYVGALFNGLTLLI LAL I SLFS I PVI YERHQV 

1 : 1 1 llhlllllll hi hllhlllllllll :| Ihll-I M 
SAATQLRHFFLVEDLVDSLKLALLFYI LTFVGAI FNGLTLLI LGVI GLFT I PLLYRQHQA 


1131 


Db 


441 


500 


Qy 


1132 


QIDHYLGLANKSVKDAMAKIQAKIPG 1157 
III hll : Ilhlllll 




Db 


501 


QI DQYVGLVTNQLSHI KAKI RAKI PG 52 6 





RESULT 10 
PCLO_HUMAN 

ID PCLO_HUMAN STANDARD; PRT; 514 7 AA. 

AC Q9Y6V0; 043373; 060305; Q9BVC8 ; Q9UIV2; Q9Y6U9; 

DT 28-FEB-2003 (Rel . 41, Created) 

DT 28-FEB-2003 (Rel. 41, Last sequence update) 

DT 15-SEP-2003 (Rel. 42, Last annotation update) 

DE Piccolo protein (Aczonin) (Fragments) . 

GN PCLO OR ACZ OR KIAA0559. 

OS Homo sapiens (Human) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

OX NCBI_TaxID=9606; 

RN [1] 

RP SEQUENCE OF 1-759 FROM N.A. 

RC TISSUE=Brain; 

RX MEDLINE=99439764; PubMed=105088 62 ; 

RA Wang X., Kibschull M . , Laue M.M., Lichte B . , Petrasch-Parwez E . , 

RA Kilimann M.W. ; 

RT "Aczonin, a 550-kd putative scaffolding protein of presynaptic active 

RT zones, shares homology regions with rim and bassoon and binds 

RT prof il in." ; 

RL J. Cell Biol. 147:151-162(1999). 

RN [2] 

RP SEQUENCE OF 552-4404 FROM N.A. 

RA Kraemer J., Wollam C, Wohldmann P., McGrane B.; 

RL Submitted (DEC-1999) to the EMBL/ GenBank/DDBJ databases. 

RN [3] 

RP SEQUENCE OF 3619-5147 FROM N.A. (ISOFORM 2). 



RC TISSUE=Brain ; 

RX MEDLINE=98290545; PubMed=9628581 ; 

RA Nagase T. , Ishikawa K.-I., Miyaj ima N. , Tanaka A., Kotani H. , 

RA Nomura N.. , Ohara 0. ; 

RT "Prediction of the coding sequences of unidentified human genes. IX. 

RT The complete sequences of 100 new cDNA clones from brain which can 

RT code for large proteins in vitro."; 

RL DNA Res . 5:31-39(1998), 

RN [4] 

RP SEQUENCE OF 4405-4439 FROM N.A. 

RC TISSUE=Placenta; 

RX MEDLINE=22388257; PubMed=12477932 ; 

RA Strausberg R.L. , Feingold E.A., Grouse L.H., Derge J.G., 

RA Klausner R.D., Collins F.S., Wagner L.,Shenmen CM., Schuler G . D . , 

RA Altschul S.F., Zeeberg B., Buetow K.H., Schaefer C.F., Bhat N.K., 

RA Hopkins R.F., Jordan H. , Moore T. , Max S.I., Wang J. , Hsieh F., 

RA Diatchenko L. , Marusina K. , Farmer A. A., Rubin G.M., Hong L. , 

RA Stapleton M., Soares M.B., Bonaldo M.F., Casavant T.L., Scheetz T.E., 

RA Brownstein M.J., Usdin T.B., Toshiyuki S. # Carninci P . , Prange C, 

RA Raha S.S., Loquellano N.A. , Peters G . J . , Abramson R.D., Mullahy S.J., 

RA Bosak S.A., McEwan P.J., McKernan K.J., Malek J.A. , Gunaratne P.H., 

RA Richards S., Worley K.C, Hale S., Garcia A.M., Gay L.J., Hulyk S.W. , 

RA Villalon D.K., Muzny D.M. , Sodergren E.J., Lu X., Gibbs R.A., 

RA Fahey J., Helton E., Ketteman M. , Madan A., Rodrigues S., Sanchez A., 

RA Whiting M., Madan A., Young A.C, Shevchenko Y., Bouffard G.G., 

RA Blakesley R.W., Touchman J.W., Green E.D., Dickson M.C, 

RA Rodriguez A.C, Grimwood J., Schmutz J., Myers R.M., 

RA Butterfield Y.S.N. , Krzywinski M.I., Skalska U. , Smailus D.E., 

RA Schnerch A., Schein J.E., Jones S.J.M., Marra M.A.; 

RT "Generation and initial analysis of more than 15,000 full-length 

RT human and mouse cDNA sequences."; 

RL Proc. Natl. Acad. Sci . U.S.A. 99:16899-16903(2002). 

RN [5] 

RP SEQUENCE OF 4405-5147 FROM N.A. 

RA Kalicki J., Elliott G. ; 

RL Submitted (FEB- 1998) to the EMBL/ GenBank / DDB J databases. 

CC -!- FUNCTION: May act as a scaffolding protein involved in the 
CC organization of synaptic active zones and in synaptic vesicle 

CC trafficking (By similarity) . 

CC -!- SUBUNIT: Interacts with Rabacl/Pral and profilin (By similarity) . 

CC -!- SUBCELLULAR LOCATION: Concentrated at the presynaptic side of 
CC synaptic junctions (By similarity) . 

CC -!- ALTERNATIVE PRODUCTS : 

CC Event ^Alternative splicing; Named isof orms=2 ; 

CC Comment ^Additional isoforms seem to exist; 

CC Name=l; 

CC IsoId=Q9Y6V0-l; Sequence=Displayed ; 

CC Name-2 ; 

CC IsoId=Q9Y6V0-2; Sequence=VSP_003 923 , VSP_003924, VSP_003925, 

CC VSP_003926, VSP_003927; 

CC Note=No experimental confirmation available; 

CC -!- DOMAIN: C2 domain 1 is involved in binding calcium and 

CC phospholipids. Calcium binds with low affinity but with high 

CC specificity and induces a large conformational change. 

CC -!- SIMILARITY: Contains 2 C2 domains. 

CC -!- SIMILARITY: Contains 1 PDZ/DHR domain. 

CC 



cc 


This SWISS-PROT 


entry is copyright. It is produced through a collaboration 


cc 


between the Swiss Institute of Bioinf ormat ics and the EMBL outstation - 


cc 


the European Bioinf ormatics Institute. There are no restrictions 


on its 


cc 


use by non-profit institutions as long as its content is in 


no way 


cc 


modified and this statement is not removed. Usage by and for commercial 


cc 


entities requires a license agreement (See http://www.isb-sib.ch/announce/ 


cc 
cc 

DR 


or send an email 


to license@isb-sib. ch) . 




EMBL; Y19188; CAB60727.1; - 






DR 


EMBL; AC004 903; 


AAD20936.1 






DR 


EMBL; AC004886; 


AAD21789. 1 






DR 


EMBL; AB011131; 


BAA25485 . 1 






DR 


EMBL; BC001304; 


AAH01304 . 1 






DR 


EMBL; AC004082; 


AAB97937 . 1 






DR 


PIR; T00634; T00634 . 






DR 


HSSP; P04410; 1A25. 






DR 


Genew; HGNC: 134 06; PCLO . 






DR 


MIM; 604918; -. 








DR 


GO; GO: 0005856; 


C: cytoskeleton; NAS . 




DR 


GO; GO: 0045202; 


C: synaptic 


junction; ISS. 




DR 


GO; GO: 0005509; 


F: calcium ion binding activity; ISS. 




DR 


GO; GO: 0005544; 


F: calcium-dependent phospholipid binding acti . . . 


• ISS. 


DR 


GO; GO: 0005522; 


F :prof ilin 


binding activity; ISS. 




DR 


GO; GO: 0007010; 


P : cytoskeleton organization and biogenesis; ISS. 




DR 


GO; GO: 0016080; 


P : synaptic 


vesicle targeting; ISS. 




DR 


InterPro; IPR000008; C2 . 






DR 


InterPro; IPR001565 ; Synaptotagmin . 




DR 


PRINTS; PR00360; 


C2 DOMAIN. 






DR 


PRINTS; PR003 99; 


S YNA P TOTAGMN . 




DR 


SMART; SM00239; 


C2; 2. 






DR 


PROSITE; PS004 99; C2_DOMAIN_l; 1. 




DR 


PROSITE; PS50004; C2_DOMAIN_2; 2. 




KW 


Calcium/phospholipid-binding; Zinc; Metal -binding; Zinc-finger; 




KW 


Repeat; Alternative splicing. 




FT 


NONJTER 1 


1 






FT 


DOMAIN 400 


465 


10 X 10 AA TANDEM APPROXIMATE REPEATS 


OF 


FT 






P-A-K-P-Q-P-Q-Q-P-X. 




FT 


ZN_FING 4 99 


523 


C4 -TYPE (POTENTIAL) . 




FT 


ZN FING 969 


992 


C4-TYPE (POTENTIAL) . 




FT 


NON_CONS 1010 


1011 






FT 


DOMAIN 2300 


2325 


POLY- PRO. 




FT 


DOMAIN 4391 


4442 


PDZ. 




FT 


DOMAIN 4544 


4633 


C2 DOMAIN 1. 




FT 


DOMAIN 5031 


5121 


C2 DOMAIN 2 . 




FT 


VARSPLIC 44 04 


4404 


S -> SGNGLG I RI VGGKE I PGHSGE I GAY I AKI LPGGSAE 


FT 






QTGKLMEG (in isoform 2) . 




FT 






/FTId=VSP_003923 . 




FT 


VARSPLIC 4534 


4534 


K -> KPTDGTKWSHPITGEIQ (in isoform 2) 




FT 






/FTId=VSP_003924 . 




b 1 


VARSPLIC 4 576 


4576 


G -> GQVMWQNAS (in isoform 2) . 




FT 






/FTId=VSP_003925 . 




FT 


VARSPLIC 4757 


4761 


TAHKS -> SKRRK (in isoform 2) . 




FT 






/FTId=VSP_003926. 




FT 


VARSPLIC 4762 


5147 


Missing (in isoform 2) . 




FT 






/FTId=VSP 003927. 




so 


SEQUENCE 5147 


AA; 563537 MW; CD5D84 990498CD3C CRC64 ; 





Query Match 5.3%; Score 308.5; DB 1; Length 5147; 

Best Local Similarity 21.4%; Pred. No. 0.00065; 

Matches 264; Conservative 150; Mismatches 438; Indels 383; Gaps 63; 

Qy 3 DIDQSSLVSSSTDSPPRPPPAFKYQFVTEPEDEEDEEEEEDEEEDDEDLEELEVLERKPA 62 

II • I I II HI = =1 : - h II 
Db 251 DIVRGESVKPSLPSPSKPP IQQPTPGKPPAQQPGHEKSQPG- PAKPP 296 

Qy 63 AGLSAAAVP PAAAAPLLDFSSDSVPPA- PRGPLPAAPPA A 101 

I I I I I = : III I I II III | 

Db 297 AQPSGLTKPLAQQPGTVKPPVQPPGTTKPPAQPLG- -PAKPPAQQTGSEKPSSEQPGPKA 354 

Qy 102 PERQPSWERSPA-APAPSLP PAAAVLPSKLPEDDEPPA 138 

Db 355 LAQPPGVGKTPAQQPGPAKPPTQQVGTPKPLAQQPGLQSPAKAPGPTKTPAQTKPPSQQP 414 

Qy 13 9 RPPPPPPAGASPLAEPAAPPSTPAAPKRRGSGSVDETLFALPAASEPVI PSSAEKIM 195 

HI! I I I I I II - II I | = | : || h 

Db 415 GSTKPPPQQPGPAKP- -SPQQPGSTKPPSQQPGS AKPSAQQ PSPAKPSA 461 

Qy 196 DLMEQPGNTVSSGQEDFPSVLLETAASLPS LS PL - S TVS F KEH - G YLGNLSAVS 247 

: I : h I : :| II : || :| | | = = 

Db 4 62 QQFTKPVSQTGFGKPLQPPTVSPSAKQPPSQGLPKTICPLCNTTELLLHVPEKANFNTCT 521 

Qy 248 SSEGTI EETLNEASKEL PERATN P FVNRDL 277 

= h I II = I h I I 

Db 522 ECQTTVCSLCGFNPNPHLTEAKEWLCLNCQMKRALGGDLAPVPSSPQPKLKTAPVTTTSA 581 

Qy 278 AEFSELEYSEMGSSFKGSPKGESAILVENTKEEVIVRSKDKEDLVCSAALHSPQESPVGK 337 

I ■■ ■ : I I ■ ■■ II =h : I h : I 

Db 582 VSKSSPQPQQTSPKKDAAPKQDLSKAPEPKKPPPLVKQPTLHGSPSAKAKQPPEADSLSK 641 

Qy 338 EDRWSPEKTMDIFNEMQMSWAPVREEYADFKPFEQAWEVKDTYEGSRDVLAARANVES 3 97 

: I I : U III II II U II I = I == I : 

Db 642 P APPKEPSVPSEQDK APV ADDKP-KQPKMVKPTTD LVSSSSATTKP 686 

Qy 3 98 KVDRKCLEDSLEQKS LGKDSEGRNEDASFPSTPEPVKDSSRAYITCASFTSATESTT 454 

Db 687 DI PSSKVQSQAEEKTTPPLKTDSAKPSQ- -SFPPTGEKV -TPFDSKAIPRP 734 

Qy 455 ANTFPLLE- -DHTSENKTD EKKIEERKAQI ITEKTSPK 490 

h : I I : I :|| I :||| | ||| 

Db 735 ASDSKI ISHPGPSSESKGQKQVDPVQKKEEPKKAQ TKMSPKPDAKPMPKGSPTPPGP 791 

Qy 4 91 - -TSNPFLVAVQDS EADYVTTDTLSKVTEAAVSNMPEGLTPDLVQEACESELNEAT 544 

Db 792 RPTAGQTVPTPQQSPKPQEQSRRFSLNLGSITDAPKSQ PTTPQET VT 838 

Qy 545 GTKIAYETKVDLVQTSEAIQESLYPTAQLCPSFEEAEATPSPVLPDI VMEAPLNSLLPSA 604 

I :: M|: I I : | : : | | | | : : 

Db 839 GKLFGFGAS I - FSQASNLI STAGQPG PHSQSGPGAP MKQAPAPSQPPTS 886 

Qy 605 GASWQPSVSPLEAPP PVSYD SIKLEP ENPPPYEE 639 

I I : I I I || : : MIL : Ml :: 

Db 887 QG PPKSTGQAPPAPAKSI PVKKETKAPAAEKLEPKAEQAPTVKRTETEKKPPPIKD 942 



Qy 



64 0 AMNVALK ALGTK- 



EGIKEPESFNAAVQ 



E 666 



: == = I II I hi :|| : 

Db 943 SKSLTAEPQKAVLPTKLEKSPKPESTCPLCKTELNIGSKDPPNFNTCTECKNQVCNLCGF 1002 

Qy 667 TEAPYISIAC DLIK ETKLSTEPSPDFSNYSEIAKFEKSVPEHA 709 

Db 1003 NPTPHLTENCQTQRAISGQLGDIRKMPPAPSGPKASPMPVP TESSSQKTAVPPQV 1057 

Qy 710 ELV EDSSPESEPV DLFSDDS I PEV PQTQEEAVMLMKESLTEVSETVA 756 

= 11 h I I : I : II : I llh I I - : • I 

Db 1058 KLVKKQEQEVKTEAEKVI LEKVKETLSMEKI PPMVTTDQKQEES - KLEKDKASALQEKKP 1116 

Qy 757 QHKEERLSAS PQEL GKPYLESFQPNLHSTKDAASNDI PTLTKKEKI SLQMEEFNTA 812 

:|-| ^ II II H I I = I I h 
Db 1117 LPEEKKLI PEEEKI RSEEKKPLLEEKKPTPEDKK LLPEAKTSAPEEQ 1163 

Qy 813 I YSNDDLLSSK EDKI KESETFSDSSPIEI IDEFPTFVSAKDDS- 855 

Db 1164 KHDLLKSQVQIAEEKLEGRVAPKTVQEGKQPQTKMEGLP SGTPQSLPKEDDKT 1216 

Qy 856 PKLAKE YTDLEVSDKSE I AN I Q SGADSLPCLELPCD 891 

I II = I llh - I I I hi 

Db 1217 TKTIKEQPQPPCTAKPDQEKE-DDKSDTSSSQQPKSPQGLSDTGYSSDGISSSLGEIP-- 1273 

Qy 8 92 LSFKNI YPKDEVHV SDEFSENRSSVSK ASISPSNVSALEPQTEMGSIVKSKS 943 

- I II : III: i I I : : =1 II I ■ lh 

Db 1274 SLIPTDEKDILKGLKKDSFSQESSPSSPSDLAKLESTVLSILEAQASTLADEKSEK 132 9 

Qy 944 LTK- - EAEKKLPSDTEKEDRSLSAVLSAELS KTS V 976 

Db 1330 KTQPHEVSPEQPKDQEK-TQSLSETLEITI SEEEI 1363 



RESULT 11 
PCLO_CHICK 

ID PCLO_CHICK STANDARD; PRT; 5120 AA . 

AC Q9PU3 6; 

DT 28-FEB-2003 (Rel . 41, Created) 

DT 28-FEB-2003 (Rel. 41, Last sequence update) 

DT 28-FEB-2003 (Rel. 41, Last annotation update) 

DE Piccolo protein (Aczonin) (Fragment) . 

GN PCLO OR ACZ. 

OS Gallus gallus (Chicken) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Archosauria; Aves; Neognathae; Gall i formes ; Phasianidae,* Phasianinae; 

OC Gallus. 

OX NCBI_TaxID=9031; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC TISSUE=Brain; 

RX MEDLINE=99439764; PubMed=10508862 ; 

RA Wang X., Kibschull M. , Laue M.M. , Lichte B., Petrasch-Parwez E . , 

RA Kilimann M. W. ; 

RT "Aczonin, a 550-kd putative scaffolding protein of presynaptic active 

RT zones, shares homology regions with rim and bassoon and binds 

RT profiling; 

RL J. Cell Biol. 147:151-162(1999). 

CC -!- FUNCTION: May act as a scaffolding protein involved in the 



CC organization of synaptic active zones and in synaptic vesicle 

CC trafficking (By similarity) . 

CC -!- SUBUNIT: Interacts with Rabacl/Pral and profilin (By similarity). 

CC -!- SUBCELLULAR LOCATION: Concentrated at the presynaptic side of 
CC synaptic junctions (By similarity) . 

CC -!- DOMAIN: C2 domain 1 is involved in binding calcium and 

CC phospholipids. Calcium binds with low affinity but with high 

CC specificity and induces a large conformational change. 

CC -!- SIMILARITY: Contains 2 C2 domains. 

CC -!- SIMILARITY: Contains 1 PDZ/DHR domain. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; Y19187; CAB60725.1; 

DR HSSP; P04410; 1A25 . 

DR GO; GO: 0045202; C: synaptic junction; ISS. 

DR GO; GO:0005509; F:calcium ion binding activity; ISS. 

DR GO; GO:0005544; F : calcium-dependent phospholipid binding acti. . .; ISS. 

DR GO; GO: 0005522; F:profilin binding activity; ISS. 

DR GO; GO: 0007010; P : cytoskeleton organization and biogenesis; ISS. 

DR GO; GO:0016080; P:synaptic vesicle targeting; ISS. 

DR InterPro; IPR000008; C2 . 

DR InterPro; IPR001478; PDZ . 

DR InterPro; IPR001565; Synaptotagmin . 

DR Pfam; PF00168; C2 ; 2 . 

DR Pfam; PF00595; PDZ; 1. 

DR PRINTS; PR003 99; SYNAPTOTAGMN . 

DR SMART; SM0023 9; C2 ; 2. 

DR SMART; SM0022 8; PDZ; 1. 

DR PROSITE; PS004 99; C2JDOMAIN_l ; 1. 

DR PROSITE; PS50004; C2_DOMAIN_2 ; 2. 

DR PROSITE; PS50106; PDZ; 1. 

KW Calcium/phospholipid-binding; Zinc; Metal-binding; Zinc-finger; 

KW Repeat . 

FT NON_TER 1 1 

FT DOMAIN 258 357 10 X 10 AA TANDEM APPROXIMATE REPEATS OF 

FT P-A-K-P-Q-P-Q-Q-P-X. 

FT ZN__FI-NG 368 3 92 C4-TYPE (POTENTIAL) . 

FT ZN_FING 836 859 C4 -TYPE (POTENTIAL) . 

FT DOMAIN 2324 2343 POLY-PRO. 

FT DOMAIN 4414 4493 PDZ. 

FT DOMAIN 4627 4726 C2 DOMAIN 1. 

FT DOMAIN 5003 5094 C2 DOMAIN 2. 

SQ SEQUENCE 5120 AA; 560751 MW; A658D98 91B65B4 12 CRC64 ; 

Query Match 5.1%; Score 300; DB 1; Length 512 0; 

Best Local Similarity 20.4%; Pred. No. 0.0014; 

Matches 258; Conservative 152; Mismatches 464; Indels 388; Gaps 56; 

Qy 14 TDSPPRPPPAFKYQFVTEPEDEEDEEE EEDEEEDDEDLEELEVLERKPAAGLSAA 68 

HI I : I Ih I : Ih ::::::: :: | |: 



Db 90 SDSDAAHEEAGRKQKVTQKEQGKPEEQRGLAKHPSQQQSPKLVQQQGPVKPTPQQTESSK 14 9 



Qy 69 AVP PAAAAPLLDFSSDSV- - PPAPRGP LPAAPPAAPERQPSWE 109 

M I : I I := I II MM 

Db 150 PVPQQQQPGEPKQGQKPGPSHP-GDSKAEQVKQPPQPRGPQKSQLQQSEPTKPGQQQTSA 2 08 

Qy 110 RSPAAPAPSLP- -PAAAVLPSKLPEDDEP PAR - P 140 

- I I II I =1 hi : I III I 

Db 2 09 KTSAGPTKPLPQQPDSAKTSSQAPPPTKPSLQQSGSVKQPSQQPARQGGPVKPSAQQAGP 268 

Qy 141 PPPPPAGASPLAEPAAPPSTPAAPKRRGSGSVDET- -LFALPAASEPVI PSS- -AEKIMD 196 

II I i i I ■ I 

Db 26 9 PKQQPGSEKPTAQQTGPAKQPPQP-GPGKTPLQQTGPVKQVPPQAGPTKPSSQTAGAAKS 327 

Qy 197 LMEQPGNTVSSGQEDFPSVLLETAASLPSLSPLSTVSFKEHGYLGNLSAVSSSEGTIEET 256 

I MM I I I : I I : III = = | 

Db 328 LAQQPGLTKPPGQQPGPEKPLQQKQA STTQPVESTPKKT 366 

Qy 257 L NEASKELPERAT NPFVNRDLAEFSE LEYS 286 

I fll II I : I I 

Db 367 FCPLCTTTELLLHTPEKANYNTCTQCHTWCSLCGFNP- -NPHITEI KEWLCLNCQMQRA 424 

Qy 287 EMGSSFKGSPKGESAILVENTKEEVI VRSKDKEDLVCSAALHSPQESPVGKEDRVVSPE- 345 

III = I I I I III I hi h 

Db 425 LGGDLASGHGPGPQ LPPPKQKTPTPASTAKPSPQLQPGQKKDASPKPDP 473 

Qy 346 KTMDIFNEMQMSWAPWEEYADFKPFEQAWEVKDTYEGSRDVLAARANVESK 398 

I ' ■ I I I : ' :\ : :: \: \ \ :\ 
Db 474 SQQADSKKPVPQKKQPSMPGSPPVKSKQTHAEPSDTGQQI -DSTPKSDQVKPTQA 527 

Qy 3 99 VDRKCLEDSLEQKSLGKD SEGRNEDASFPSTPEPVKDSSRAYITCASFTSA 44 9 

h I h I : | :| : | :| |:: :| : 

Db 528 EEKQNQPSIQKPTMDTVPTSAAPGVKQDLADPQSP STQQKVTDS PMPET 576 

Qy 450 TESTTANTFPLLEDHTSENKTDEKKIEERKAQI ITEKTSP KTSNPFLV 4 97 

M I : I : =| I | : |= :|: | Ml 

Db 577 TK PPADTHPAGDKPDSKPL PQVSRQKSDPKLASQSGAKSDAKTQKPSEP 625 

Qy 4 98 A-VQDSEADYVT TDTLSKVTEAAVSNMPE 52 5 

I hi I I : I I : h 

Db 626 APVKDDPKKLQTKPAPKPDTKPAPKGPQAGTGPRPTSAQPAPQPQQPQKTPEQSRRFSLN 685 

Qy 526 --GLT PDLVQEACESEL NEA TGTKIAYETKVDLVQTSEAIQ 564 

hi I II :| -I | : :\ :: | 

Db 686 LGGITDAPKPQPTTPQETVTGKLFGFGASIFSQASSLISTAGQPGSQTSGPAPPATKQPQ 745 

Qy 565 ESLYPTAQLCPSFEEAEATPSP-VLPDI VMEAPLNS- -LLPSAGASWQPSVSPLEAPPP 621 

M I I hi I I I M I I I I I : Ml I 

Db 74 6 PPSQPPASQAPPKEAAQAQPPPKAAPTKKETKPLASEKLGPMASDSTLTTKGSDLEKKPS 8 05 

Qy 622 VSYDSI KLEPENPPPYE EAMMVALKALGTKEG I KE P ES FNAA 663 

Db 806 LAKDSKHQTAEAKKPAELSEQEKASQPKVSCPLCKTGLNIGSKDPPNFNTCTECKKWCN 865 

Qy 664 VQETEAPYISIAC DLIK 680 

Db 866 LCGFNPMPHIVEVQE WLCLNCQTQRAMSGQLGDMGKVPLPKLGPSQPVSKPPATPQ 921 



Qy 


681 


ETKLSTEPSPDFSNYSEIAKFEKSVP EHAEL -VEDSS 

1 1 1 1 ' I • 1 1 1 1 • ■ 1 


716 


Db 


922 


I I I I * 1 1 111**1 
KQPVPAVSHSPQKSSTPPTPAATKPKEEPSVPKEVPKLQQGKLEKTLSADKIQQGIQKED 


981 


Qy 


717 


PESEP VDLF - - - SDDS I PEVPQTQEEAVMLMKESLTEVS ETVAQH - KEERLSAS PQELGK 


772 


Db 


982 


:h II 1 1 1 1 | - : | : | |= = :=| : :| | 
AKSKQGKLFKTPSADKIQRVSQKEDSRLQQTKLTKTPSSDKILHGVQKEDIKFQEAKLAK 


1041 


Qy 


773 


PYLESFQPNLHSTKDA- - - ASNDI PTLTKKEKI SLQMEEFNTAI YSN 

1 . II II ... 1 .M.ii t 


816 


Db 


1042 


1 • II II * * * 1 * 1 1 - M : | : : : 
I PSADKILHRLQKEDPKLQQMKMAKALSADKIQPEAQKEDVQLQEVRLSKAVSADKIQHG 1101 


Qy 


817 


- - DDL LSSKEDKI KESETFSDSSPI EI I -DEFPTFVSA 


851 


Db 


1102 


M H Ml !'h 1 1 : h | ||: 
IQKDLNLQHVKIEKTSSVEKIQEAQKESKLQQDKLPKTLSEDKIPATVSSDHKKLLSKSE 


1161 


Qy 


852 


KDDSPKIAKEYTDLEVSDKSEIANIQSG- -ADSLPCLELPCDLSFKNI YPKDEVHVSDE- 


908 


Db 


1162 


: 1 hi == 1 1 =1 =1 = : 1 III 1 : 1 1 

EDKKPELLEKSTPHPKDKKEQI TAETTGHI TEQKVEVEAPCD KLHEKKQEDVKKED 


1217 


Qy 


909 


FSENRSSVSKA- -SISPSNVSALEPQTEMGSIVKSKSLTKEAEKKLPSDTEKEDRS 

^ 1 II : 1 III h- h 1 : II H III 


962 


Db 


1218 


LTTGI PQMVSKPEKAEEEKTPVPVSRL-PRSDHVEAVREK-IEKEDDK SDTSSSQQQ 


1272 


Qy 


963 


LS 964 




Db 


1273 


KS 12 74 





RESULT 12 
CPN DROME 



ID CPN_DROME STANDARD; PRT; 865 AA. 

AC Q02910; 

DT 01-OCT-1993 (Rel . 27, Created) 

DT 01-OCT-1993 (Rel. 27, Last sequence update) 

DT 15-SEP-2003 (Rel. 42, Last annotation update) 

DE Calphotin. 

GN CPN" OR CAP . 

OS Drosophila melanogaster (Fruit fly) . 

OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; 

OC Neoptera; Endopterygota ; Diptera; Brachycera; Muscomorpha; 

OC Ephydroidea; Drosophil idae; Drosophila. 

OX NCB-I_TaxID=7227 ; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=Canton-S; 

RX MEDLINE=93165729; PubMed=8 0 94 55 9 ; 

RA Martin J.H., Benzer S., Rudnicka M. , Miller C.A. ; 

RT "Calphotin: a Drosophila photoreceptor cell calcium-binding protein.' 1 ; 

RL Proc. Natl. Acad. Sci . U.S.A. 90:1531-1535(1993). 

RN [2] 

RP SEQUENCE FROM N.A. 

RC STRAIN=Canton-S ; 

RX MEDLINE=93165730; PubMed=8434 015 ; 

RA' Ballinger D.G., Xue N. , Harshman K.D.; 

RT "A Drosophila photoreceptor cell -specif ic protein, calphotin, binds 



RT calcium and contains a leucine zipper."; 

RL Proc. Natl. Acad. Sci. U.S.A. 90:1536-1540(1993). 

CC -!- FUNCTION: Might function as a calcium-sequestering "sponge" to 
CC regulate the amount of free cytoplasmic calcium. It binds 0.3 mole 

CC of Ca(2+) per mole of protein. 

CC -!- SUBUNIT: Homodimer (Probable). 

CC -!- SUBCELLULAR LOCATION: CYTOPLASMIC; HYPODENSE COMPARTMENT. 

CC -!- TISSUE SPECIFICITY: SOMA AND AXONS OF PHOTORECEPTOR CELLS OF 

CC COMPOUND EYES AND OCELLI . 

CC -!- DEVELOPMENTAL STAGE: EXPRESSED EARLY IN PHOTORECEPTOR CELL 
CC DEVELOPMENT. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinformatics and the EMBL outstation - 

CC the European Bioinformatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; L02111; AAA28405.1; 

DR EMBL; L05080; AAA28420.1; -. 

DR PIR; A47282; A47282 . 

DR PIR; A47283; A47283 . 

DR FlyBase; FBgn0010218; Cpn. 

DR GO; GO: 0005509; F: calcium ion binding activity; IDA. 

KW Calcium-binding. 



FT 


CONFLICT 


36 


36 


A - > AVAPAWA (IN REF . 2) 


FT 


CONFLICT 


43 


43 


I -> T (IN REF. 


2) . 


FT 


CONFLICT 


64 


64 


I -> V (IN REF. 


2) . 


FT 


CONFLICT 


76 


76 


T -> A (IN REF . 


2) . 


FT 


CONFLICT 


100 


100 


P -> PP (IN REF. 


2) . 


FT 


CONFLICT 


126 


127 


VQ -> AP (IN REF. 2) . 


FT 


CONFLICT 


154 


154 


I -> V (IN REF. 


2) . 


FT 


CONFLICT 


160 


160 


S -> T (IN REF . 


2) . 


FT 


CONFLICT 


534 


534 


A -> E (IN REF. 


2) . ' 


FT 


CONFLICT 


699 


699 


I -> T (IN REF. 


2) . 


FT 


CONFLICT 


703 


703 


V -> L (IN REF. 


2) . 


FT 


CONFLICT 


721 


721 


D - > E (IN REF. 


2) . 


SQ 


SEQUENCE 


865 AA; 


84781 


MW; 2110417E0B0E7CFE CRC64 ; 



Query Match 5.1%; Score 2 95.5; DB 1; Length 865; 

Best Local Similarity 21.3%; Pred. No. 0.00021; 

Matches 217; Conservative 127; Mismatches 379; Indels 295; Gaps 41; 
Qy 62 AAGLSAAAVPPAAAAPLLDFSSDSVPPAPRGPLPAAPPAAPER QPSWERSPAAPAP 117 

: l IMIh I HIM I III lh Mill 

Db 11 SAPVAAPVTPSAVAAPVQWSPAAVAPAPAAPIAVTPVAPPPTLASVQPATVTIP-APAP 69 
Qy 118 SLPPAAAVLPSKLPEDDEPPARPPPPPPAGA SPLAEPAAPPSTP 161 

IM hi I : Ml Ml Ml I I II 

Db 70 IAAASVTPVASVAPPWAAPTPPAASPVSTPVAVAQIPVAVSAPVAPPVAATPTPWQIP 129 

QY 162 -AAPKRRGSGSVDETLFALP- -AASEP VI PSSAEKIMDLMEQPGNT VSSGQED 211 

Ml M I III I I I : I | M 

Db 13 0 VAAP VIATPPVAASAPTPAAVTPVISPVIASPPWPANTTVPVAAPVAA 178 



212 FPSVLLETAASL-PSLSPLSTVSFKEHGYLGNLSAVSSSEGTIEETLNEASKELPERATN 270 
179 VPAAVPWAPVLAPAVAPAVAPWAETPAPPPVAEIPVA--TIPECVAPLIPEVSW^ 236 
271 PFVNRDIAEFSELEYSEMGSSFKGSPKGESAILVENTKEEVIVRSKDKEDLVCSAALHSP 330 

I I I 

237 PLA AAEPVWAPPATETPWAPAAASP 263 

331 QESPVGKEDRWSPEKTMDIFNEMQMSWAPVREEYADFKPFEQAWEVKDTYEGSRDVLA 3 90 

I hi = = :|IMI 

2 64 HVS VAP AVETAWAPV 27 9 

391 ARANVESKVDRKCLEDSLEQKSL GKDSEGRNEDASFPSTPEPVKDSSRAYITCASF 446 

h I I I = I =| : I = I II I :: 

280 -SASTEPPVAAATLTTAPETPALAPWAESQVAANTWATPPTPAPEPETIAPPWAETP 338 

447 TSATESTTANTFPLLEDHTSENKTDEKKIEERKAQI ITEKTSPKTSNPFLVAVQDSEADY 506 
339 EVASVAVAETTPPWPPVAAES 1 PAP WATT PVPATLAVTDPD 381 

507 VTTDTLSKVTEAAVSNMPEGLTPDLVQEACESELNEATGTKIAYETKVDLVQTSEAIQES 566 

II =11 =1 = I I I II Ml 
382 VTASAVPELPPVIAPSPVPSA VAETPVDLA PPV 414. 

567 LYP -TAQLCPSF - - EEAEATPS PVLPDI VMEAPLNS LLP SAGAS WQ PS VS PLEA P 619 

I I h h II M i : : I h :| : || :| ||| 

415 LPPVAAEPVPAWAEETPETPAPASAPVTI -AALD- - 1 PEVAPVIAAPSDAPAEAPSAAA 471 

620 PPVSYDS I KLEPENPPPYEEAMNVALKALGTKEGI KEPESFNAAVQETEAPYI SIACD- - 677 

I M II hi II II - : 

472 PIVS TPPT TASVPETTAPPAAVPTEPI 498 

678 LIKETKLSTEPSPDFSNYSEIAKFEKSVPEHAE- -LVEDSSPESEPVDLFSD-DSIP 731 

: : I : I : I =hl = : II I -| I : II h| 

499 DVSVLSEAAIETPVAPPVEVTTEVAVADVAPPEAAADLIIEPVEPPAPIPDLLEQTTSVP 558 

732 EVPQTQEEAVMLMKESLTEVSETVAQHKEERLSASPQELGKPYLESFQPNLHSTKDAASN 791 

I : : : : II H II - :| =11 :|:| : : 

559 AVEAAESTSSPI PETSLPPPNEAVA SPEVAVAPITAPEPI PEP-EPSLATPTEPI PV 614 

792 DI PTLTKKEKI SLQMEEFNTAI YSNDDLLSSKEDKI KESETFSDSSPI EI IDEFPTFVSA 851 

' I := | :: | : :| | ||| |: 

615 EAPV VIQEAVDAVEVPVTETSTSI PETTVEFPEAVAE 651 

852 KDDS PKLAKE YTDLE V- SDKS E IAN I QSGAD SLPCLEL PCDLSFKNIYP- 899 

11= 1= I : = = = 111 II = I =1= I- | | 

652 KVLDPAI TEAPVTTQEPDVANINDGAPATEITTPAVEI VTAAAEVSDIAIPVIDPP 707 

900 -KDEVHVSDEFSENRSSVSKASISPSNVSALEPQTEMG 936 

h h I I : - : i : I h 

708 VPQEIAVA-EI PETDTKPAEVIVEQSTI PI EAPVPEVSKYAEPVI SEAPAAEVPITAGDN 766 

937 SIVKSKSLTKEAEKKLPSDTEKEDRSLSAVLSAELSKTSWDLLYWRDIKKTGV 990 

h h = III = | | : | : = || ||:: | | 

767 PDNTSVGISEWPTIAEKPVEEVPTSEIPEQSSSPSDSVPVAKITPLL- -RDLQTTDV 822 



RESULT 13 
MAPB RAT 



ID MAPB_RAT STANDARD; PRT; 24 5 9 AA . 

AC P15205; Q62958; Q9ER21; Q9QW92 ; 

DT 01-APR-1990 (Rel . 14, Created) 

DT ■ 16-OCT-2001 (Rel. 40, Last sequence update) 

DT 28-FEB-2003 (Rel. 41, Last annotation update) 

DE Microtubule-associated protein IB (MAP IB) (Neuraxin) [Contains: MAPI 

DE light chain LCI] . 

GN MAP1B . 

OS Rattus norvegicus (Rat) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Rattus. 

OX NCBI_TaxID=10116; 

RN [1] 

RP SEQUENCE OF 1-142 FROM N.A. 

RC STRAIN=Sprague-Dawley; TISSUE^Testis ; 

RX MEDLINE=96257242; PubMed=8 6662 95 ; 

RA Liu D. f Fischer I.; 

RT "Isolation and sequencing of the 5 T end of the rat microtubule- 

RT associated protein (MAP1B) -encoding cDNA." ; 

RL Gene 172:307-308(1996). 

RN [2] 

RP SEQUENCE OF 96-2459 FROM N.A. , DOMAIN, AND INDUCTION. 

RC STRAIN=Sprague-Dawley; TISSUE=Brain, and Glial tumor; 

RX MEDLINE=92347374; PubMed=163 9092 ; 

RA Zauner W. , Kratz J., Staunton J., Feick P., Wiche G.; 

RT "Identification of two distinct microtubule binding domains on 

RT recombinant rat MAP IB."; 

RL Eur. J . Cell Biol. 57:66-74(1992). 

RN [3] 

RP SEQUENCE OF 1541-2459 FROM N.A., AND TISSUE SPECIFICITY . 

RC TISSUE=Spinal cord; 

RX MEDLINE=9005 98 71; PubMed=255515 0 ; 

RA Rienitz A., Grenningloh G., Hermans -Borgmeyer I., Kirsch J., 

RA Littauer U.Z., Prior P., Gundel finger E.D., Schmitt B., Betz H.; 

RT "Neuraxin, a novel putative structural protein of the rat central 

RT nervous system that is immunologically related to microtubule- 

RT associated protein 5."; 

RL EMBO J. 8:2879-2888(1989). 

RN [4] 

RP DEVELOPMENTAL STAGE, AND PHOSPHORYLATION. 

RX MEDLINE=97405699; PubMed=926 0743 ; 

RA Ma D . , - Nothias F., Boyne L.J., Fischer I . ; 

RT "Differential regulation of microtubule-associated protein IB (MAP1B) 

RT in rat CNS and PNS during development."; 

RL J. Neurosci. Res. 49:319-332(1997). 

CC -!- FUNCTION: The function of brain MAPS is essentially unknown. 
CC Phosphorylated MAP1B may play a role in the cytoskeletal changes 

CC that accompany neurite extension. Possibly MAP1B Binds to at least 

CC two tubulin subunits in the polymer, and this bridging of subunits 

CC might be involved in nucleating microtubule polymerization and in 

CC stabilizing microtubules. 

CC -!- SUBUNIT: 3 different light chains, LCI, LC2 and LC3 , can associate 
CC with MAP1A and MAP1B proteins. 

CC -!- TISSUE SPECIFICITY : Nervous system (spinal cord, brain stem, 

CC cerebellum and cerebrum) . Not expressed in liver, spleen, kidney, 



cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
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cc 
cc 
cc 
cc 
cc 
cc 
cc 

DR 
DR 
DR 
DR 
DR 
DR 
DR 
KW 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
SQ 



heart or muscle. 

DEVELOPMENTAL STAGE : In cerebral cortex, spinal cord and sciatic 
nerve levels are high early in development but decrease during 
postnatal development and are low in adults. In dorsal root 
ganglia levels remain high throughout development. 
INDUCTION: By nerve growth factor. 

DOMAIN: Has a highly basic region with many copies of the sequence 
KKEE and KKEl/V, repeated but not at fixed intervals, which is 
responsible for the binding of MAP1B to microtubules. 
PTM: LCI is coexpressed with MAP1B. It is a polypeptide generated 
from MAP1B by proteolytic processing. It is free to associate with 
both MAPI A and MAP1B. It interacts with the amino -terminal region 
of MAP1B (By similarity) . 
PTM: Phosphorylated. 
SIMILARITY: TO MAP1A . 

CAUTION: A C-terminal fragment of this protein (residues 1597 to 
2459) was originally described as neuraxin in Ref.3. 

This SWISS-PROT entry is copyright. It is produced through a collaboration 
between the Swiss Institute of Bioinf ormat ics and the EMBL outstation - 
the European Bioinf ormat ics Institute. There are no restrictions on its 
use by non-profit institutions as long as its content is in no way 
modified and this statement is not removed. Usage by and for commercial 
entities requires a license agreement (See http://www.isb-sib.ch/announce/ 
or send an email to license@isb-sib . ch) . 

EMBL; U52950; AAB17068.1; 
EMBL; X60370; CAC16162.1; -. 
EMBL; X16623; CAA34620.1; ALT_SEQ . 
PIR; A56577; A56577. 

InterPro; IPR000102; MAPlB_neuraxin . 
Pfam; PF00414; MAPlB_neuraxin ; 10. 
PROSITE; PS00230; MAP 1 B_NEURAX I N ; 8. 
Microtubules ; Repeat ; Phosphorylat ion . 
CHAIN ? 

REPEAT 1869 1885 MAP1B 1. 

REPEAT 1886 1902 MAP1B 2. 

REPEAT 1903 1919 MAP1B 3. 

REPEAT 1920 1936 MAP1B 4. 

REPEAT 1937 1953 MAP1B 5. 

REPEAT 1954 1970 MAP1B 6. 

REPEAT 1988 2004 MAP1B 7. 

REPEAT 2005 2021 MAP1B 8. 

REPEAT 2022 2 03 8 MAP IB 9. 

REPEAT 2039 2055 MAP1B 10. 

DOMAIN 559 
DOMAIN 588 

2224 
127 
140 
2112 
2169 



DOMAIN 
CONFLICT 
CONFLICT 
CONFLICT 
CONFLICT 
SEQUENCE 



2459 
1885 
1902 
1919 
1936 
1953 
1970 
2004 
2021 
2038 
2055 
1035 
786 

2312 
127 
140 
2112 
2169 



MAPI LIGHT CHAIN LCI 



GLU-RICH. 

LYS-RICH (HIGHLY BASIC, CONTAINS MANY 
KKEE AND KKEl/V REPEATS) . 
LYS-RICH. 

M -> V (IN REF. 1) . 
T - > S (IN REF. 1) . 
R -> K (IN REF. 3) . 
L -> I (IN REF. 3) . 



2459 AA; 269497 MW; 2E3F6872DEDB8BA2 CRC64 ; 



Query Match 

Best Local Similarity 



5.0%; Score 291.5; DB 1; Length 2459; 
20.0%; Pred. No. 0.0011; 



Matches 220; Conservative 167;. Mismatches 456; Indels 255; Gaps 44; 

Qy 3 0 TEPEDEEDEEEEEDEEEDDEDLEELEVLERKPAAGLSAAAVPPAAAAPLLDFSSDSVPPA 8 9 

: I I IN- II hi : I I I | || | : : I : 

Db 1008 SEEEGEEEEDKAEDAREEDHEPDKTE AEDYVMAWDKAAEAGVTEDQYDFL 1058 

Qy 90 PRGPLPAAPPAAPERQPSWERSPAAPAPSLPPAAAVLPSKLPEDDEPPARPPPPPPAGAS 14 9 

I : I I :|| I!-: II I I II 

Db 1059 GTPAKQ PGVQSPSREPASSIHDETLPGGSESEAT AS 1094 

Qy 150 PLAEPAAPPSTPAAPKRRGSGSVDETLFALPAASEPVIPSSAEKIMDLMEQPGN TVS 206 

I I III: :||| hi | | | 

Db 1095 DEENREDQPEEFTAT SGYTQST IEISSEPTPMDEMSTPRDVMTDETNNEETES 1147 

Qy 2 07 SGQE DFPSVLLETAASLP SLSPLS TVSFKEHGYLGNLSAVSSSEGTIE 254 

II : I I II h II I : I : I :| I 

Db 1148 PSQEFVNITKYESSLYSQEYSKPWASFNGLSDGSKTDATDGRDYNASASTISPPSSMEE 1207 

Qy 255 ETLNEAS KELPERATNPFVNRDLAEF- -SELEYSEMG 289 

: :::: |:: : :|. : |: | :| : ;| 

Db 12 08 DKFSKSALRDAYRPEETDVKTGAELDIKDVSDERLSPAKSPSLSPSPPSPIEKTPLGERS 1267 

Qy 290 SSFKGSPKGESAILVENTKEEVIVR SKDKEDL VCSAALHSP 330 

Db 1268 WFSLTPNEIKASAEGEATAWSPGVTQAWEEH 1327 

Qy 331 -QESPVGKEDRWSPEKTMDIFNEMQMSWAPVREEYADFK PFEQAWEVKD 38 0 

:\\ : \ \ = = || |: : | | == | | 

Db 1328 YYQSPTDEKSSHLPTEVT ENAQAVPVS FEFTEAKDENERSS I S PMDE - - PVPD 1378 

Qy 381 TYEGSRDVLAARANVESKVDRKCLED- -SLEQKSLGKDS EGRNEDASFPSTPEPVK 434 

lh = III: h I I : I Ihl I || 

Db 1379 SESPIEKVLSPLRSPPLIGSESAYEDFLSADDKALGRRSESPFEGKNGKQGFSDKESPVS 1438 

Qy 435 DSSRAYITCASFTSATESTTANTFPLLEDHTSENKT DEKKI 475 

I : I I I h M = I I Ihh 

Db 1439 D LTSDLYQDKQEEKRAGF I PI KEDFS PEKKASDAEI MSSQSALALDERKLGGDGS 1493 

Qy 4 76 EERKAQI ITEKTSPKTSNPFLVAVQDSEADYVTTDTLSKVTEAAV- - 52 0 

h I I I I : : I :h I = == h hi 
Db 1494 PTQVDVSQFGSFKEDTKMS I SEGTVSDKSATPVDEGAEDT YSHMEGVASVSTASVAT 1550 

Qy 521 SNMPEGLTPDLVQEACESELNEATGTKIAYETKVDLVQTSEAIQES-LYPTAQLCP S 576 

h II III : :h | = = I UN lh = h = II I 

Db 1551 SSFPEPTTDD-VSPSLHAEVGSPHSTEVDDSLSVSWQTPTTFQETEMSPSKEECPRPMS 1609 

Qy 577 FEEAEATP SPVLPDIVMEAPLNSLL PSAGASWQ 610 

-I \ ■■ ■■ hi :|| h II - 

Db 1610 ISPPDFSPKTAKSRTPVQDHRSEQSSMSIEFGQESPEHSLANDFSRQSPDHPTVGAGMLH 1669 

Qy 611 PSVSPLEAPPPVSYD S I KLEPENPPPYEEAMNVALKALGTKEGI KEPESFN 661 

III | |: | | | :::::: : | : : 

Db 1670 I TEN GPTEVDYSPSDIQDSSLSHKI PPTEEPSYTQDNDLS-ELISVSQVEASPSTSS 1725 

Qy 662 AAVQETEAPYISIACDLIKETKLSTEPSPDFSNYSEIAKFEKSVPEHAELVEDSSPESEP 721 

I II I ::| I I I h H : I ■ I Ihh 

Db 1726 AHTPS QIASPLQEDTLSDWPPRDMSLYASLASEKVQSLEGEKL- - - -SPKSDI 1775 



Qy 722 VDLFSDDSIPEVPQTQEEAVMLMKESLTEVSETVAQHKKERLSASPQELGKPYL-ESFQP 780 

I : I I - III I :| : : =| | | - I 

Db 1776 SPLTPRESSPTYSPGFSDSTSGAKES-TAAYQTSSSPPIDAAAAEPYGFRSSMLFDTMQH 1834 

QY 781 NLHSTKDAASNDI PTLTKKEKISLQMEEFNTAI YSNDDLLSSKEDKI KESETFSDSSPI E 840 

Db 1835 HLALSRDLTTSSV EKDNGGKTPGDFNYAYQKPESTTESPDEEDYDYE - - SHEKTI Q 1888 

QY 841 IIDEFPTFVSAKDDSPK LAKEYTDLEVSDKSEIANIQSGADSLPCLELPCDLSFKN 896 

I = = : I I :| : h : I I |: = : I 

Db 1889 AHDVGGYYYEKTERTI KS PCDSGYS YETI EKTTKTP EDGGYS CEITEKT 1937 

QY 897 IYPKDEVHVSDEFSENRSSVSKAS ISPSNVSALEPQTEMGSIVKSKSL 944 

U I I II : : I : : = = | : | 

Db 1938 TRTPEEGGYSYEI SEKTTRTPEVSGYTYEKTERSRRLLDDI SNGYDDTEDGGHTLGDCSY 1997 

Qy 94 5 TKEAEKKLPSDTEKEDRS 962 

= I H= I I I I 

Db 1998 SYETTEKITSFPESESYS 2015 

RESULT 14 
MAPB_HUMAN 

ID MA PB__HUMAN STANDARD; PRT; 24 68 AA. 

AC P46821; 

DT 01-NOV-1995 (Rel . 32, Created) 

DT 01-NOV-1995 (Rel. 32, Last sequence update) 

DT 16-OCT-2001 (Rel. 40, Last annotation update) 

DE Microtubule-associated protein IB (MAP IB) [Contains: MAPI light chain 

DE LCI] . 

GN MAP1B . 

OS Homo sapiens (Human) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

OX NCBI_TaxID=9606 ; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC TISSUE=Fetal brain; 

RX MEDLINE=95104835; PubMed=78 062 12 ; 

RA Lien L.L., Feener C, Fischbach N., Kunkel L.M.; 

RT "Cloning of human microtubule-associated protein IB and the 

RT identification of a related gene on chromosome 15 . *' ; 

RL Genomics 22:273-28 0(1994). 

CC -!- FUNCTION: THE FUNCTION OF BRAIN MAPS IS ESSENTIALLY UNKNOWN. 

CC PHOSPHOR YLATED MAP IB MAY PLAY A ROLE IN THE CYTOSKELETAL CHANGES 

CC THAT ACCOMPANY NEURITE EXTENSION . POSSIBLY MAP1B BINDS TO AT LEAST 

CC TWO TUBULIN SUBUNITS IN THE POLYMER, AND THIS BRIDGING OF SUBUNITS 

CC MIGHT BE INVOLVED IN NUCLEATING MICROTUBULE POLYMERIZATION AND IN 

CC STABILIZING MICROTUBULES. 

CC -!- SUBUNIT: 3 DIFFERENT LIGHT CHAINS, LCI, LC2 AND LC3 , CAN ASSOCIATE 

CC WITH MAP1A AND MAP1B PROTEINS. 

CC -!- DOMAIN: Has a highly basic region with many copies of the sequence 

CC KKEE and KKEI/V, repeated but not at fixed intervals, which is 

CC responsible for the binding of MAP1B to microtubules. 

CC -!- PTM: LCI is coexpressed with MAP1B . It is a polypeptide generated 

CC from MAP1B by proteolytic processing. It is free to associate with 



CC both MAP1A and MAP1B . It interacts with the amino- terminal region 

CC of MAP1B (By similarity) . 

CC -!- SIMILARITY: TO MAP1A. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to 1 icense@isb-sib . ch) . 

CC 

DR EMBL; L06237; AAA18904.1; -. 

DR Genew; HGNC:683 6; MAP IB. 

DR MIM; 157129; -. 

DR GO; GO: 0005875; C : microtubule associated complex; TAS . 

DR InterPro; IPR000102; MAPlB_neuraxin . 

DR Pfam; PF00414; MAPlB_neuraxin ; 10. 

DR PROSITE; PS00230; MAP1B_NEURAXIN; 6. 

KW Microtubules; Repeat; Phosphorylation. 



FT 


CHAIN 




2468 


MAPI LIGHT CHAIN LCI. 


FT 


REPEAT 


1878 


1894 


MAP1B 


1 . 


FT 


REPEAT 


1895 


1911 


MAP1B 


2 . 


FT 


REPEAT 


1912 


1928 


MAP1B 


3 . 


FT 


REPEAT 


1929 


1945 


MAP1B 


4 . 


FT 


REPEAT 


1946 


1962 


MAP1B 


5. 


FT 


REPEAT 


1963 


1979 


MAP1B 


6. 


FT 


REPEAT 


1997 


2013 


MAP1B 


7 . 


FT 


REPEAT 


2014 


2030 


MAP1B 


8 . 


FT 


REPEAT 


2031 


2047 


MAP1B 


9. 


FT 


REPEAT 


2048 


2064 


MAP1B 


10. 


FT 


DOMAIN 


589 


790 


LYS-RICH {HIGHLY BASIC, CONTAINS MANY 


FT 








KKEE AND KKEI/V REPEATS) . 


SQ 


SEQUENCE 


2468 


AA; 270618 


MW; 540839CBDF09D461 CRC64 ; 


Query Match 




4.9%; 


Score 


288; DB 1; Length 2468; 


Best Local , 


Similarity 20.2%; 


Pred . 


No. 0.0015; 



Matches 251; Conservative 166; Mismatches 441; Indels 386; Gaps 56; 



Qy 


13 


Db 


625 


Qy 


51 


Db 


685 


Qy 


111 


Db 


727 


Qy 


169 


Db 


783 



II I 



PA- 



= I M =1 

KDSVA AGKPKEKGKI 782 

-ASEPVI PSSAEKIMDLME 199 
I - I : II 



Qy 



2 00 QPGNTVSSGQEDFPSV-LLETAASLPSLSPLSTVSFKEHGYLGNLSAVSSSEGTIEETLN 258 



843 LKAEEVDVTKDI KPQLELI EDEEKLKETEPVEAYVI QKEREVTKGPAES PDEG - ITTTEG 901 

259 EASKELPERATNPFVNRDLAEFSELEYSEMGSSFKGSPKGESAILVENTK-EEVIVRSKD 317 

II I = = = = I : h h I h | : || :| 

902 EGECEQTPEELEPVEKQGVDDI EKFE - - DEGAGFEES - - S ETGDYEEKAETEEAEEPEED 957 

318 KEDLVC - SAALHS P QES P VGKEDR WS PEKTMD I FNEMQMS V 358 

1= II 11= Ml HI Ml hil l 
958 GEEHVCVSASKHSPTEDEESAKAEADAYIREKRESVASGDDRA EEDMD EAIEKG 1011 

359 VAPVREEYADFKPFEQAWEVK-DTYEGSRDVLAARANVESKVDRKCLEDSLEQ 410 

I II II : -I : : : || : : I I : lh |: 
1012 EAEQSEEEADEE- -DKAEDAREEEYEPEK- -MEAEDYVMAWDKAAEAGGAEEQYGFLTT 1067 

411 - - KSLGKDSEGR NEDASFPSTPEPVKDSSRAYITCASFTSATESTT 454 

I II I M =1 : I I :| : : :| :| : 

1068 PTKQLGAQS PGREPASS I HDETLPGGSESEATASDEENREDQPEEFTATSGYTQSTI EI S 1127 

455 ANTFPL LEDHTSENKTDEK KI EERKAQI I TEKTS - PKTSNPFLVAV 499 

' 1 = : | h =|: | ::::::: | | | 

1128 SEPTPMDEMSTPRDVMSDETNNEETESPSQEFVNITKYESSLYSQEYSKPADVTPLNGFS 1187 

500 QDSEADYVTTDTLSKVTEAAVSNMPEGLTPDLVQEACESELNEATGTKIAYETKVDL 556 

118 8 EGS KTD - - ATDG KD YNAS AS TISPPSSMEED KFSRSALRDAYCS E VKASTTLD I KDS 1242 

557 - - VQTSEAI QESLYPTAQLCP SFE EAEATPSPVLPDI V 592 

= 11 : I h I :| ||| :|| |:: 

1243 ISAVSSEKVSPSKSPSLSPSPPSPLEKTPLGERSVNFSLTPNEIKVSAEAEVAPVSPEVT 13 02 

5 93 ME APLNSLLPSAG ASWQ 610 

I =1 h III |:: 

13 03 QEWEEHCASPEDKTLEWSPSQSVTGSAGHTPYYQSPTDEKSSHLPTEVIEKPPAVPVS 1362 

6H PSVSPLEAPPPVSYDSIK- -LEPENPPP YE EAMN VALKALG 649 

I I I I : : IN h II II || :: || | 

1363 FEFSDAKDENERASVSPMDEPVPDSESPIEKVLSPLRSPPLIGSESAYESFLSADDKASG 1422 

650 TKEGIKEPESFNAAVQETEAPYISIACDLIKETKLSTEPSPD-FSNYSEIA KF 701 

: |:|: 11 : I I I I "11= I 

1423 RGAESPF EE KS GKQG S P DQ VS P VS EMTS TS LYQD KQ 1458 

702 EKSVPEHAELVEDSSPE SEPVDLFSDDSIPEVPQTQ EEAV 741 

I : I : II I |:| : : :| || |: 

1459 EGKSTDFAPIKEDFGQEKKTDDVEAMSSQPALALDERKLGDVSPTQIDVSQFGSFKEDTK 1518 

742 MLMKE SLTEVSETVAQ HKEERLSASPQELG KPYLESFQPNLHSTK 786 

I : I I I I I lh || || : :| : |:|| 

1519 MSISEGTVSDKSATPVDEGVAEDTYSHMEGVASVSTASVATSSFPEPTTDDVSPSLH 1575 

787 DAASNDI PTLTK-KEKI SLQMEEFNTAI YSNDDLLSSKEDKI KESETFSDSSPI EI I DEF 845 
I I h : :|: : : | : :: |||: = 
1576 - -AEVGSPHSTEVDDSLSVS WQTPTT- FQETEMSPSKEECPR 1615 

84 6 PTFVSAKDDSPKLAKEYTDLE- -VSDKSEIANIQSGADSLPCLELPCDLSFKN 8 96 

I 1 I I Ml M I - h:| = :|= | ■ =| | | | | :: 

1616 PMSISPPDFSPKTAKSRTPVQDHRSEQSSM-SIEFGQES-PEQSLAMDFSRQSPDHPTVG 1673 



Qy 897 IYPKDE-VHVSDEFSENRSSVSKASISPSNV 926 

I I :| : I III: III 

Db 1674 AGVLHITENGPTEVDYSPSDMQDSSLSHKIPPMEEPSYTQDNDLSELISVSQVEASPSTS 1733 

Qy 927 SALEPQTEMGS I VKSKSLTKEAEKKLPSDTEKEDRSLSAVLSAE 970 

II I | :: :|: | : | || | |::| 

Db 1734 SAHTP-SQIASPLQEDTLSDVAPPR- -DMSLYASLTSE 1768 

RESULT 15 
MAPB__MOUSE 

ID MAPB_MOUSE STANDARD; PRT; 24 64 AA . 

AC P14873; 

DT 01-APR-1990 (Rel . 14, Created) 

DT 01-APR-1990 (Rel. 14, Last sequence update) 

DT 28-FEB-2003 (Rel. 41, Last annotation update) 

DE Microtubule-associated protein IB (MAP IB) (MAPI. 2) (MAPI (X) ) 

DE [Contains: MAPI light chain LCI] . 

GN MAP1B OR MTAP1B OR MTAP5 . 

OS Mus musculus (Mouse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus. 

OX NCBI_TaxID=10090; 

RN [1] 

RP SEQUENCE FROM N.A., AND DOMAIN. 

RC STRAIN=Swiss Webster; TISSUE=Brain ; 

RX MEDLINE=90094539; PubMed=248 0963 ; 

RA Noble M., Lewis S.A., Cowan N.J.; 

RT "The microtubule binding domain of microtubule-associated protein 

RT MAP1B contains a repeated sequence motif unrelated to that of MAP2 

RT and tau. " ; 

RL J . Cell Biol. 109:3367-3376(1989). 

CC -!- FUNCTION: THE FUNCTION OF BRAIN MAPS IS ESSENTIALLY UNKNOWN. 
CC PHOSPHORYLATED MAP1B MAY PLAY A ROLE IN THE CYTOSKELETAL CHANGES 

CC THAT ACCOMPANY NEURITE EXTENSION. POSSIBLY MAP1B BINDS TO AT LEAST 

CC TWO TUBULIN SUBUNITS IN THE POLYMER, AND THIS BRIDGING OF SUBUNITS 

CC MIGHT BE INVOLVED IN NUCLEATING MICROTUBULE POLYMERIZATION AND IN 

CC STABILIZING MICROTUBULES. 

CC -!- SUBUNIT: 3 DIFFERENT LIGHT CHAINS, LCI, LC2 AND LC3 , CAN ASSOCIATE 
CC WITH MAP1A AND MAP1B PROTEINS. 

CC -!- DOMAIN: Has a highly basic region with many copies of the sequence 

CC KKEE and KKEI/V, repeated but not at fixed intervals, which is 

CC responsible for the binding of MAP IB to microtubules. 

CC -!- PTM: LCI IS COEXPRESSED WITH MAP1B. IT IS A POLYPEPTIDE GENERATED 

CC FROM MAP1B BY PROTEOLYTIC PROCESSING. IT IS FREE TO ASSOCIATE WITH 

CC BOTH MAP1A AND MAP1B . IT INTERACTS WITH THE AMI NO -TERMINAL REGION 

CC OF MAP1B. 

CC -!- SIMILARITY: TO MAPI A . 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 



cc 

DR EMBL; X51396; CAA35761.1; -. 

DR PIR; S07549; QRMSP1 . 

DR MGD; MGI: 1306778; Mtaplb. 

DR GO; GO:0016358; P: dendrite morphogenesis; IMP. 

DR GO; GO: 0001578; P : microtubule bundling; IMP. 

DR InterPro; IPR000102; MAPlB_neuraxin. 

DR Pfam; PF00414; MAPlB_neuraxin ; 10. 

DR PROSITE; PS0023 0; MA P 1 B__NEURAX I N ; 7. 

KW Microtubules; Repeat; Phosphorylation. 



FT 


CHAIN 




2464 


MAPI LIGHT CHAIN LCI. 


FT 


REPEAT 


1874 


1890 


MAP1B 


1. 


FT 


REPEAT 


1891 


1907 


MAP1B 


2. 


FT 


REPEAT 


1908 


1924 


MAP1B 


3 . 


FT 


REPEAT 


1925 


1941 


MAP1B 


4 . 


FT 


REPEAT 


1942 


1958 


MAP1B 


5 . 


FT 


REPEAT 


1959 


1975 


MAP1B 


6. 


FT 


REPEAT 


1993 


2009 


MAP1B 


7. 


FT 


REPEAT 


2010 


2026 


MAP1B 


8. 


FT 


REPEAT 


2027 


2043 


MAP1B 


9. 


FT 


REPEAT 


2044 


2060 


MAP1B 


10. 


FT 


DOMAIN 


589 


787 


LYS-RICH (HIGHLY BASIC, CONTAINS MANY 


FT 








KKEE AND KKEI/V REPEATS) . 


SQ 


SEQUENCE 


2464 


AA; 270408 


MW; FBD3DD99CFDBDA87 CRC64 ; 


Query Match 




4.9%; 


Score 


284; DB 1; Length 2464; 


Best Local j 


Similarity 20.7%; 


Pred. 


No. 0.0021; 



Matches 233; Conservative 159; Mismatches 424; Indels 312; Gaps 51; 



Qy 


31 


EPEDEEDEEEEEDEEED DEDLEELE VLERKPAAG LSAAAVP 


71 


Db 


1009 


1 1 hi Mlh II : I II 1 :: II | :| 

EAEQSEEEGEEEDKAEDAREEGYEPDKTEAEDYVMAVADKAAEAGVTEEQYGYLGTSAKQ 


1068 


Qy 


72 


PAAAAPLLDFSS DSVPPAPRGPLPAAPPAAPERQP SWERSPAAPA 


116 


Db 


1069 


1 =1 ■ ■■ 1 -I hill 1 : I 

PGI QS PSREPASS I HDETLPGGSESEATASDEENREDQPEEFTATSGYTQSTI E I SSEPT 


1128 


Qy 


117 


P SLPPAAAVLPSKLPEDDEPPARP PPPPPAGASPLAEPA- - 


155 


Db 


1129 


1 • 1 h 1 I- ||: |:| : 

PMDEMSTPRDVMSDETNNEETESPSQEFVNITKYESSLYSQEYSKPAVASFNGLSEGSKT 


1188 


Qy 


156 


APPSTPAAPKRRGSGSVDETLFALPA - ASEPVI PSSAEKI MDLMEQ 

1 II : 1 h:| hi : 1 : :||| ^h : 


200 


Db 


118 9 


DATDGKDYNASASTISPP SSMEEDKFSKSALRDAYCSEEKELKASAE - -LDTKDV 


1241 


Qy 


201 


PGNTVSSGQEDFPSVLLETAASLPSLSPLSTVSFKEHGYLGNLSAVSSSEGTIEETLNEA 


260 


Db 


1242 


: 1 1 Mill 1 Ihl 

SDERLS PAKSPSLSP SPPSPIEKT 


1265 


Qy 


261 


SKELPERATNPFVNRDLAEFS ELEYSEMGSSFKGSPKGESAILVEN- -TKEEVIVR 

1 11= 1 II | = = 1 1 : || |:: |: : || : 


314 


Db 


1266 


- - PLGERSVN FSLTPNE I KVSAEGEARS VS PGVTQAWEEHCAS PEEKTLE 


1314 


Qy 


315 


SKDKEDLVCSAALHSP-QESPVGKEDRWSPEKTMDIFNEMQMSWA-PVREEYADFK-- 


370 


Db 


1315 


WSPSQSVTGSAGHTPYYQSP TDEKSSHLPTEVSENAQAVPVSFEFSEAKDE 


1366 



Qy 371 PFEQAWEVKDTYEGSRDVLAARANVESKVDRKCLED- -SLEQKSLGKDS 417 

I I I s M= : || | : | ||: | 

Db 1367 NERA.SLSPMDE- -PVPDSESPVEKVLSPLRSPPLLGSESPYEDFLSADSKVLGRRSESPF 1424 

QY 418 EGRNEDASFPSTPEPVKDSSRAYITCASFTSATESTTANTFPLLEDHTSENKT 47 0 

IN M III: : : I : |: || | |l 

Db 142 5 EGKNGKQGF PDRES P VS DLT STGLYQDKQEEKSTGFIPIKEDFGPEKKTSDVETMS 14 80 

Qy 471 DEKKI EERKAQI ITEKTSPKTSNPFLVAVQDSEAD 505 

Ihh hi I I I : : | | :| 

Db 1481 SQSALALDERKLGGDVSPTQIDVSQFGSFKEDTKMSISEGTVSDKSATPVDEGV--AEDT 1538 

QY 506 YVTTDTLSKVTEAAV- - SNMPEGLTPDLVQEACESELNEATGTKI AYETKVDLVQTSEAI 563 

I ■■■ h hi h || Ml : :|: |:: | :||| 

Db 153 9 YSHMEGVASVSTASVATSSFPEPTTDD-VSPSLHAEVGSPHSTEVDDSLSVSWQTPTTF 15 97 

Qy 564 QES-LYPTAQLCP SFEEAEATP SPVLPDI VMEAPLNSLL- - 601 

lh : |: : II I ■ : :| | : : |:| :| 

Db 1598 QETEMS PSKEECPRPMS I S PPDFS PKTAKSRTPVQDHRSEQSSMS IEFGQESPEHSFAMD 1657 

Qy 602 PSAGASWQPSVSPLEAPPPVSYD SIKLEPENPPPYEEAMNVA 644 

h I I I h : : IN | | : I I I = : : : 

Db 1658 FSRQSPDHPTLGASVLHITEN GPTEVDYSPCDIQDSSLSHKI PPTEEPSYTQDNDLS 1714 

Qy 645 LKALGTKEGIKEPESFNAAVQETEAPYISIACDLIKETKLSTEPSPDFSNYSEIAKFEKS 704 

= * ' I ' : I II I - I I : I h :| 
Db 1715 -ELISVSQVEASPSTSSAHTPS-- QI AS PLQEDTLSDWPPREMSLYASLA 1762 

Qy 705 VPEHAELVEDSSPESEPVDLFSDDSI PEVPQTQEEAVML MKESLTEVSETVAOHKE 760 

: I 1= l||: : | :| : || I I 

Db 1763 - - - -SEKVQ- -SLEGEKLSPKSDIS PLTPRESSPLYSPGFSDSTSAAKETAAAH- - 1810 

Qy 761 ERLSAS P QELGKPY LESFQPNLHSTKDAASNDI PTLTKKEKI SLQMEEFN 810 

: hi! HI :: I :| :| :: : :|: : I I 

Db 1811 -QASSSPPIDAATAEPYGFRSSMLFDTMQHHLALNRDLTTSSV EKDSGGKTPGDFN 1865 

Qy 811 TAI YSNDDLLSSKEDKIKESETFSDSSPIEI IDEFPTFVSAKDDSPK LAKEYTDLE 866 

I :: | ::: : | | | | : : : | | :| 

Db 1866 YAYQKPENAAGS PDEEDYDYE - - SQEKTI RTHDWRYYYEKTERTI KSPCDSGYS YETI E 1923 

Qy 867 VSDKSEIANIQSGADSLPCLELPCDLSFKNI YPKDEVHVSDEFSENRSSVSKAS 920 

' 1^ : I h = = I =1 I I II : : I 

Db 1924 KTTKTP EDGG YTCEITEKTTRTPEEGGYSYEI SEKTTRTPEVSGYTYEK 1972 

Qy 921 ISPSNVSALEPQTEMGSIVKSKSLTKEAEKKLPSDTEKEDRS 962 

: . = : ■ \ ■■ I = I :h I I I I 

Db 1973 TERSRRLLDDI SNGYDDTEDGGHTLGDCSYS YETTEKI TSFPESESYS 2020 



Search completed: December 19, 2003, 15:35:43 
Job time : 23 sees 



GenCore version 5.1.6 
Copyright (c) 1993 - 2003 Compugen Ltd. 



OM protein - protein search, using sw model 
Run on : 



Title: 

Perfect score: 
Sequence : 

Scoring table: 



Searched: 



December 19, 2003, 15:32:53 ; Search time 46 Seconds 

( wi thout a 1 ignment s ) 
6524.243 Million cell updates/sec 

US-09-830-972-2 
5848 

1 MEDIDQSSLVSSSTDSPPRP VKDAMAKI QAKI PGLKRKA.D 1163 

BLOSUM62 

Gapop 10.0 , Gapext 0.5 

830525 seqs, 258052604 residues 



Total number of hits satisfying chosen parameters: 



830525 



Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 



Post -processing : Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 



Database : SPTREMBL_23 : * 

1 : sp_archea : * 

2: sp_bacteria : * 

3: sp_f ungi : * 

4 : sp_human : * 

5: sp_invertebrate : * 

6 : sp_mammal : * 

7 : sp_mhc : * 

8: sp_organelle: * 

9 : sp_phage : * 

10: sp_plant:* 

1 1 : sp_rodent : * 

12: sp_virus:* 

13 : sp_vertebrate : * 

14: sp_unclassif ied: * 

15: sp_rvirus:* 

16: sp_bacteriap : * 

17 : sp_ar cheap : * 

Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 



SUMMARIES 

o, 
o 

Result Query 

No. Score Match Length DB ID Description 



1 


5312 . 5 


90 . 


. 8 


1162 


11 


Q8BGM9 


Q8bgm9 mus musculu 


2 


5307 


90 , 


. 7 


1163 


11 


Q8K3G8 


Q8k3g8 mus musculu 


3 


4501 . 5 


77 , 


. 0 


1046 


11 


Q8BGK7 


Q8bgk7 mus musculu 


4 


3627 . 5 


62 , 


. 0 


986 


4 


Q8IUA4 


Q8iua4 homo sapien 


5 


2926 


50 . 


. 0 


639 


11 


Q8K290 


Q8k2 9 0 mus musculu 


6 


1416 


24 , 


.2 


375 


11 


Q8BHF5 


Q8bhf5 mus musculu 


7 


1314 . 5 


22 , 


. 5 


356 


11 


Q8BH78 


Q8bh78 mus musculu 


8 


1304 


22 , 


.3 


357 


11 


Q8K3G7 


Q8k3g7 mus musculu 


9 


1283 . 5 


21 , 


. 9 


392 


4 


Q96B16 


Q96bl6 homo sapien 


10 


844 


14 . 


. 4 


179 


6 


Q9GM33 


Q9gm33 macaca fasc 


11 


792 


13 , 


. 5 


780 


11 


Q8K4S4 


Q8k4s4 mus musculu 


12 


788 


13 , 


. 5 


780 


11 


Q8K0T0 


Q8kOtO mus musculu 


13 


762 . 5 


13 , 


. 0 


760 


13 


Q90638 


Q90638 gallus gall 


14 


700 


12 , 


. 0 


643 


11 


Q8CCU2 


Q8ccu2 mus musculu 


15 


685 


11 . 


. 7 


199 


4 


Q9BQ5 9 


Q9bq5 9 homo sapien 


16 


671 


11 . 


. 5 


267 


11 


Q63765 


Q63765 rattus sp. 


17 


669 


11 . 


.4 


208 


13 


Q90637 


Q90637 gallus gall 


18 


625 . 5 


10 , 


. 7 


236 


11 


Q8VBU0 


Q8vbu0 rattus norv 


19 


625 . 5 


10 , 


. 7 


237 


11 


Q8C6D5 


Q8c6d5 mus musculu 


20 


586 


10 . 


. 0 


595 


5 


Q9VMV9 


Q9vmv9 drosophila 


21 


532 


9 , 


. 1 


224 


5 


Q9VMW1 


Q9vmwl drosophila 


22 


520 


8 . 


. 9 


202 


5 


Q9VMW2 


Q9vmw2 drosophila 


23 


520 


8 . 


. 9 


222 


5 


Q9VMW4 


Q9vmw4 drosophila 


24 


520 


8 . 


. 9 


234 


5 


Q9VMW3 


Q9vmw3 drosophila 


25 


517 


8 . 


. 8 


2484 


5 


Q9U347 


Q9u347 caenorhabdi 


26 


503 . 5 


8 , 


. 6 


2607 


5 


Q23187 


Q23187 caenorhabdi 


27 


377 


6 . 


,4 


2768 


5 


Q9VC00 


Q9vc00 drosophila 


28 


347 . 5 


5 . 


. 9 


10578 


5 


Q8ISF5 


Q8isf5 caenorhabdi 


29 


343 . 5 


5 . 


. 9 


18519 


5 


Q8ISF6 


Q8isf6 caenorhabdi 


30 


343 . 5 


5 , 


. 9 


18534 


5 


Q8ISF7 


Q8isf7 caenorhabdi 


31 


335 


5 . 


. 7 


5412 


5 


Q9W596 


Q9w596 drosophila 


32 


331 


5 . 


. 7 


4900 


5 


Q9N541 


Q9n541 caenorhabdi 


33 


328 . 5 


5 , 


. 6 


5327 


5 


076891 


076891 drosophila 


34 


322 


5 . 


. 5 


7962 


4 


Q10465 


Q10465 homo sapien 


35 


320 


5 . 


. 5 


222 


5 


Q23188 


Q23188 caenorhabdi 


36 


317.5 


5 . 


. 4 


17352 


5 


Q95YM2 


Q95ym2 procambarus 


37 


313 . 5 


5 , 


. 4 


1444 


5 


Q9VTN2 


Q9vtn2 drosophila 


38 


313 . 5 


5 , 


. 4 


1514 


5 


Q8SY55 


Q8sy55 drosophila 


39 


309.5 


■ 5 , 


. 3 


1852 


3 


Q9C2H4 


Q9c2h4 neurospora 


40 


308 


5 , 


.3 


2362 


5 


Q9VYD1 


Q9vydl drosophila 


41 


307 


5 . 


:2 


34350 


4 


Q8WZ42 


Q8wz42 homo sapien 


42 


306 


5 , 


.2 


842 


5 


Q9VGC9 


Q9vgc9 drosophila 


43 


306 


5. 


.2 


864 


5 


Q9VGC8 


Q9vgc8 drosophila 


44 


305 . 5 


5 . 


.2 


9196 


5 


Q8IQ87 


Q8iq87 drosophila 


45 


304 


5 . 


.2 


864 


5 


Q95U4 5 


Q95u45 drosophila 



ALIGNMENTS 



RESULT 1 
Q8BGM9 



ID 

AC 
DT 
DT 
DT 



Q8BGM9 

Q8BGM9; 

01-MAR-2003 

01-MAR-2003 

01-MAR-2003 



PRELIMINARY; PRT; 1162 AA. 

(TrEMBLrel . 23, Created) 

(TrEMBLrel . 23, Last sequence update) 

(TrEMBLrel. 23, Last annotation update) 



DE RTN4 . 

GN RTN4 . 

OS Mus mus cuius (Mouse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus. 

OX NCBI_TaxID=10090 ; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=129/SvcJ7 / and 129SvcJ7; 

RA Oertle T., van der Putten H. , Schwab M.E.; 

RT "Genomic Structure and Functional Characterization of the Promoter 

RT Structures of Human and Mouse Nogo/Rtn-4 . " ; 

RL Submitted (OCT-2 002) to the EMBL / GenBank / DDB J databases. 

RN [2] 

RP SEQUENCE FROM N.A. 

RC STRAIN=129/SvcJ7, and 129SvcJ7; 

RA Oertle T\, Schwab M.E.; 

RL Submitted (MAY-2 002) to the EMBL/GenBank/DDBJ databases. 

RN [3] 

RP SEQUENCE FROM N.A. 

RC STRAIN=129/SvcJ7; 

RA Van der Putten H. ; 

RL Submitted (MAY-2002) to the EMBL/GenBank/DDBJ databases. 

RN [4] 

RP SEQUENCE FROM N.A. 

RC STRAIN=129SvcJ7; 

RA Van der Putten H. , Mir A.; 

RL Submitted (MAY-2002) to the EMBL/GenBank/DDBJ databases. 

DR EMBL; AY102284; AAM73506.1; 

DR EMBL; AY102286; AAM73511.1; 

SQ SEQUENCE 1162 AA; 126613 MW; 855697FBEE11781F CRC64 ; 



Query Match 90.8%; Score 5312.5; DB 11; Length 1162; 

Best Local Similarity 91.0%; Pred. No. 3.1e-262; 

Matches 1064; Conservative 37; Mismatches 55; Indels 13; Gaps 6; 



Qy 


i 


MEDIDQSSLVSSSTDSPPRPPPAFKYQFVTEPEDEEDEEEEEDEEEDDEDLEELEVLERK 


60 


Db 


l 


-Mil 1 1 1 : 1 1 1 : : 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

MEDIDQSSLVSSSADSPPRPPPAFKYQFVTEPEDEEDEEDEE-EEEDDEDLEELEVLERK 


59 


Qy 


61 


PAAGLSAAAVPPAAAAPLLDFSSDSVPPAPRGPLPAAPPAAPERQPSWERSPAAPAPSLP 

MINIM III IIIIIIIIIIMIIIIIIMIIIMI Ml MINIMI Mill 

PAAGLSAAPVPP-AAAPLLDFSSDSVPPAPRGPLPAAPPTAPERQPSWERSPAASAPSLP 


120 


Db 


60 


118 


Qy 


121 


PAAAVLPSKLPEDDEPPARPPPPPPAGASPLAEPAAPPSTPAAPKRRGSGSVDETLFALP 


180 


Db 


119 


MINI Ml llllllllll II llllll II MMMIIMI Mill llllll MM 

PAAAVIjPSKIjPEDDEPPAR- -PPAPAGASP1AEPAAPPSTPAAPKRRGSGSVDETLFALP 


176 


Qy 


181 


AASEPVIPSSAEKIMDLMEQPGNTVSSGQEDFPSVLLETAASLPSLSPLSTVSFKEHGYL 


240 


Db 


177 


lllllll llllllllll II Mill II II II II III lllllll III II III III Mill 

AASEPVI PSSAEKIMDLKEQPGNTVSSGQEDFPSVLFETAASLPSLSPLSTVSFKEHGYL 


236 


Qy 


241 


GNLSAVSSSEGTI EETLNEASKELPERATNPFVNRDIAEFSELEYSEMGSSFKGS PKGES 


300 


Db 


237 


1 II 1 1 h h M 1 1 1 M II 1 1 h II II 1 1 1 1 M II N MM 1 1 1 1 i 1 1 1 1 1 1 MM 

GNLSAVASTEGTIEETLNEASRELPERATNPFVNRESAEFSVLEYSEMGSSFNGSPKGES 


296 


Qy 


301 


AI LVENTKEEVI VRS KDKEDLVCSAALHS PQES P VGKEDRWSPEKTMDI FNEMQ 355 



I III hlllllMllllh 

WKEDGVMSPEKTMDIFNEMK 356 



Db 


297 


AMLVENTKEEVIVRSKDKEDLVCSAALHNPQESPATLTKWKEDGVMSPEKTMDIFNEMK 3 56 


Qy 


356 


MSVVAPWEEYADFKPFEQAWEVKDTYEGSRDVLAARANVESKVDRKCLEDSLEQKSLGK 415 

IMIIIIIIIIIIIMIIIIMIIIIIIIMIIIIIIIhllllhll Illllll II 


Db 


357 


MSWAPWEEYADFKPFEQAWEVKDTYEGSRDVLAARANMESKVDKKCFEDSLEQKGHGK 416 


Qy 


416 


DSEGRNEDASFPSTPEPVKDSSRAYITCASFTSATESTTANTFPLLEDHTSENKTDEKKI 


475 


Db. 


417 


Ml llhllll III III Illllll Ihllllll II Ihlllllllllllllll 

DSESRNENASFPRTPELVKDGSRAYITCDSFSSATESTAANIFPVLEDHTSENKTDEKKI 


476 


Qy 


476 


EERKAQI ITEKTSPKTSNPFLVAVQDSEADYVTTDTLSKVTEAAVSNMPEGLTPDLVQEA 535 

II MIIIMI IMMIIII- MINI II Illllll h IIIIMIIIMM 


Db 


477 


EERKAQI ITEKTSPKTSNPFLVAIHDSEADYVTTDNLSKVTEAWATMPEGLTPDLVQEA 536 


Qy 


536 


CESELNEATGTKIAYETKVDLVQTSEAIQESLYPTAQLCPSFEEAEATPSPVLPDIVMEA 


595 


Db 


537 


II IIIIMIIMMIlllllllllillllMIIMIIIIII IMIMillllillill 

CESELNEATGTKIAYETKVDLVQTSEAIQESI YPTAQLCPSFEEAEATPSPVLPDIVMEA 


596 


Qy 


596 


PLNSLLPSAGASWQPSVSPLEAPPPVSYDSIKLEPENPPPYEEAMNVALKALGTKEGIK 


655 


Db 


597 


MIIIMI MM III MM 1 Mill MIMMIMM llhllll Ml II 

PLNSLLPSTGASVAQPSASPLEVPSPVSYDGIKLEPENPPPYEEAMSVALKTSDSKEEIK 


656 


Qy 


656 


EPESFNAAVQETEAPYISIACDLIKETKLSTEPSPDFSNYSEIAKFEKSVPEHAELVEDS 


715 


Db 


657 


IIIIIMI II 1 1 1 1 1 M l 1 1 1 1 1 1 1 |. |||:|| 

EPESFNAAAQEAEAPYISIACDLIKETKLSTEPSPEFSNYSEIAKFEKSVPDHCELVDDS 


716 


Qy 


716 


SPESEPVDLFSDDSIPEVPQTQEEAVMLMKESLTEVSETVAQHK-EERLSASPQELGKPY 


774 


Db 


717 


MIMIMMIMMIMMMMIMMMIMMMM Ml Mlllllllhllll 

SPESEPVDLFSDDS I PEVPQTQEEAVMLMKESLTEVSETVTQHKHKERLSASPQEVGKPY 


776 


Qy 


. 775 


LESFQPNLHSTKDAASNDI PTLTKKEKI SLQMEEFNTAI YSNDDLLSSKEDKI KESETFS 


834 


Db 


777 


II III! 1 hi 1 hi ,1 1 1 1 1 hi 

LESFQPNLHITKDAASNEI PTLTKKETISLQMEEFNTAI YSNDDLLSSKEDKMKESETFS 


836 


Qy 


835 


DSSPIEIIDEFPTFVSAKDDSPKLAKEYTDLEVSDKSEIANIQSGADSLPCLELPCDLSF 

IIIIIIMIIIIIIIIIIIIII IIMIIIIhlllllhlllhllll IIIIMII 


894 


Db 


837 


DSSPIEIIDEFPTFVSAKDDSP KEYTDLEVSNKSEIANVQSGANSLPCSELPCDLSF 


893 


Qy 


895 


KNIYPKDEVHVSDEFSENRSSVSKASISPSNVSALEPQTEMGSIVKSKSLTKEAEKKLPS 


954 


Db 


894 


1 1 MIM MIIM|::|||||| : llllll 1 llhlll 1 IIMIhllll 
KNTYPKDEAHVSDEFSKSRSSVSKVPLLLPNVSALESQIEMGNIVKPKVLTKEAEEKLPS 


953 


Qy 


955 


DTEKEDRSLSAVLSAELS KTS WDLLYWRDI KKTGWFGASLFLLLSLTVFS I VSVTAYI 


1014 


Db 


954 


M 1 1 M 1 1 h II II II h 1 1 M II II 1 II M II 1 1 II 1 II 1 1 1 1 II 1 1 II 1 II 1 II II II 

DTEKEDRSLTAVLSAELNKTSWDLLYWRDI KKTGWFGASLFLLLSLTVFS I VSVTAYI 


1013 


Qy 


1015 


ALALLSVTISFRIYKGVIQAIQKSDEGHPFRAYLESEVAISEELVQKYSNSALGHVNSTI 

IIIIMIIMIIIIMIIIIIIIIIIIIMIIIIIIIMIIIIIIIIIIIIIIIIIIIII 


1074 


Db 


1014 


ALALLSVTI SFRI YKGVI QAI QKSDEGHPFRAYLESEVAI SEELVQKYSNSALGHVNSTI 


1073 


Qy 


1075 


KELRRLFLVDDLVDSLKFAVLMWVFTYVGALFNGLTLLI LALI SLFSI PVI YERHQVQID 1134 


Db 


1074 


M M 1 II II II 1 1 II II II II II II II M II 1 II II II M II 1 M II 1 II 1 M II 1 III 

KELRRLFLVDDLVDSLKFAVLMWVFTYVGALFNGLTLLI LALI SLFS I PVI YERHQAQI D 1133 


Qy 


1135 


HYLGLANKSVKDAMAKIQAKI PGLKRKAD 1163 





MIIIIIIIIMMIIIMIIIIIIIIh 



Db 1134 HYLGLANKS VKDAMAKI QAKI PGLKRKAE 1162 



RESULT 2 
Q8K3G8 

ID Q8K3G8 PRELIMINARY; PRT; 1163 AA. 

AC Q8K3G8; 

DT Ol-OCT-2002 (TrEMBLrel . 22, Created) 

DT Ol-OCT-2002 (TrEMBLrel . 22, Last sequence update) 

DT Ol-MAR-2003 (TrEMBLrel. 23, Last annotation update) 

DE Nogo-A. 

GN RTN4 . 

OS Mus musculus (Mouse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus. 

OX NCBI_TaxID= 10090; 

RN [1] 

RP SEQUENCE FROM N . A . 

RC STRAIN=BALB/ C ; 

RA Jin W., Long M. , Li R. , Ju G . ; 

RT "Cloning and expression of the mouse Nogo-A protein."; 

RL Submitted (MAY-2002) to the EMBL/GenBank / DDB J databases. 

DR EMBL; AY114152; AAM77068 . 1 ; -. 

DR MGD; MGI: 1915835; Rtn4 . 

DR InterPro; IPR003388; Reticulon. 

DR Pfam; PF02453; Reticulon; 1. 

DR PROSITE; PS50845; RETICULON; 1. 

SQ SEQUENCE 1163 AA; 126691 MW; 6B5F3 62 7994 17EA4 CRC64 ; 

Query Match 90.7%; Score 5307; DB 11; Length 1163; 

Best Local Similarity 91.1%; Pred. No. 6e-262; 

Matches 1066; Conservative 34; Mismatches 56; Indels 14; Gaps 7; 
Qy 1 MEDIDQSSLVSSSTDSPPRPPPAFKYQFVTEPEDEEDEEEEEDEEEDDEDLEELEVLERK 60 

Illllllllllll llllllllllllllllllllllllhll IMIIMIIIIIIIIII 

Db 1 MEDIDQSSLVSSSADSPPRPPPAFKYQFVTEPEDEEDEEDEE-EEEDDEDLEELEVLERK 59 

Qy 61 PAAGLSAAAVPPAAAAPLLDFSSDSVPPAPRGPLPAAPPAAPERQPSWERSPAAPAPSLP 12 0 

lllllll IMIIIIIIIIIIIIIIMIIIMMIII llllllllllllll Mill 

Db 60 PAAGLSAVPVPP-AAAPLLDFSSDSVPPAPRGPLPAAPPTAPERQPSWERSPAASAPSLP 118 

QY 121 PAAAVLPSKLPEDDEPPARPPPPPPAGASPLAEPAAPPSTPAAPKRRGSGSVDETLFALP 18 0 

M M I I I II i I !' I I I Ml hi , I I 

Db 119 PAAAVLPSKLPEDDEPPAR--PPAPAGASPLAEPAAPPS-TPAAPKRRGSGSVDETLFALP 176 

Qy 181 AASEPVIPSSAEKIMDLMEQPGNTVSSGQEDFPSVLLETAASLPSLSPLSTVSFKEHGYL 24 0 

IIIIIIIIMIIIIIII MMMMMIMIMM 

Db 177 AASEPVI PSSAEKIMDLKEQPGNTVSSGQEDFPSVLFETAASLPSLSPLSTVSFKEHGYL 236 

Qy 241 GNLSAVSSSEGTIEETLNEASKELPERATNPFVNRDLAEFSELEYSEMGSSFKGSPKGES 300 

1 1 1 1 1 1 = f ^ F I i 1 1 1 f 1 1 1 1 1 = 1 1 1 1 1 1 1 1 1 1 1 1 1 ^ MM llllllllll lllllll 

Db 237 GNLSAVASTEGTIEETLNEASRELPERATNPFVNRESAEFSVLEYSEMGSSFNGSPKGES 2 96 
Qy 3 01 AI LVENTKEEVI VRS KDKEDLVCSAALHS PQES P VGKEDRWSPEKTMDI FNEMQ 355 

. I Ml ■ . i ■ - 

Db 297 AMLVENTKEEVIVRSKDKEDLVCSAALHNPQESPATLTKWKEDGVMSPEKTMDIFNEMK 356 



Qy 


356 


MSWAPWEEYADFKPFEQAWEVKDTYEGSRDVLAARANVESKVDRKCLEDSLEQKSLGK 

1 1 1 1 1 I'l 1 1 M hi 1 1 1 1 M 1 M MINIM II 

MSWAPWEEYADFKPFEQAWEVKDTYEGSRDVLiAARANMESKVDKKCFEDSLEQKSHGK 


415 


Db 


357 


416 


Qy 


416 


DSEGRNEDASFPSTPEPVKDSSRAYITCASFTSATESTTANTFPLLEDHTSENKTDEKKI 


475 


Db 


417 


III MMIIIMM III IMMM 11111:11 II 1 MINIMUM II II 

DSESRNENASFPSTPELVKDGSRAYITCDSFTSATESTAANIFPVLEDHTSENKTDEKKI 


476 


Qy 


476 


EERKAQI ITEKTSPKTSNPFLVAVQDSEADYVTTDTLSKVTEAAVSNMPEGLTPDLVQEA 


535 


Db 


477 


1 1 IN II 1 ; Ml II II 1 II 1 II II M 1 1 1 1 1 1 1 1 1 1 1 1 1 

EERKAQI I TEKTSPKTSNPFLVAIHDSEADYVTTDNLSKVTEAWATMPEGLTPDLVQEA 


536 


Qy 


536 


CESELNEATGTKIAYETKVDLVQTSEAIQESLYPTAQLCPSFEEAEATPSPVLPDIVMEA 


595 


Db 


537 


llllll INNIM llllll IMIIIMNMII 1 II III Illlllll III lllllll 1 

CESELNEATGTKIAYETKVDLVQTSEAIQESI YPTAQLCPSFEEAEATPSPVLPDIVMEA 


596 


Qy 


596 


PLNSLLP SAGAS WQPSVS PLEAPPPVS YDS I KLEPENPPPYEEAMNVALKALGTKEGI K 


655 


Db 


597 


MINIM MM Ml MM 1 MMI II! ■ IIMIIIIMM 1 1 1 1 

PLNSLLPSTGASVAQPSASPLEVPSPVSYDGIKLEPENPPPYEEAMSVALKTSDAKEEIK 


656 


Qy 


656 


EPESFNAAVQETEAPYISIACDLIKETKLSTEPSPDFSNYSEIAKFEKSVPEHAELVEDS 


715 


Db 


657 


MMIMI M MMMMMMMMMMMI MMIMMMMMM M M M 

EPESFKfAAAQEAEAPYISIACDLIKETKLSTEPSPGFSNYSEIAKFEKSVPDHCELVDDS 


716 


Qy 


716 


SPESEPVDLFSDDSI PEVPQTQEEAVMLMKESLTEVSETVAQHK-EERLSASPQELGKPY 


774 


Db 


717 


M 1 1 1 M M M 1 II IN Mill 

SPESEPVDLFSDDSI PEVPQTQEEAVMLMKESLTEVSETVTQHKHKERLSASPQEVGKPY 


776 


Qy 


775 


LESFQPNLHSTKDAASNDI PTLTKKEKI SLQMEEFNTAI YSNDDLLSSKEDKI KESETFS 


834 


Db 


111 


Ml 1 1 1 1 : 1 II || || I | | | | || || | | || || II 1 M 1 1 1 1 II 1 

LESFQPNLHI TKI3AASNEIPTLTKKETI SLQMEEFNTAI YSNDDLLSSKEDKM KESETFS 


836 


Qy 


835 


DSSPIEI IDEFPTFVSAKDDSPKLAKEYTDLEVSDKSEIANIQSGADSLPCLELPCDLSF 


894 


Db 


837 


NIINNNNINNNNI IMIIMM II MMI MMIMI Illlllll 

DSSPIEI I DEFPTFVSAKDDSP KEYTDLEVSNKSEIANVQSGANSLPCSELPCDLSF 


893 


Qy 


895 


KNI YPKDEVHVSDEFSENRSSVSKASISPSNVSALEPQTEMGSIVKSKSLTKEAEKKLPS 

II INN MINIMUM : llllll 1 MMI 1 INNMINI 


954 


Db 


894 


KNTYPKDEAHVSDEFSKSRSSVSKVPLLLPMVSALESQIEMGNIVKPKVLTKEAEEKLPS 


953 


Qy 


955 


DTEKEDRSLSAVLSAELSKTSWDLLYWRDIKKTGW-FGASLFLLLSLTVFSI VSVTAY 


1013 


Db 


954 


1 MM II II M ! M M II II II II II 1 M m MMMMMMMMMMM 

DTE KE DRSLTAVLSAELNKTS WDLLYWRD I KKTGVVYFGASLFLLLSLTVF SI VSVTAY 


1013 


Qy 


1014 


IALALLSVTISFRI YKGVIQAIQKSDEGHPFRAYLESEVAISEELVQKYSNSALGHVNST 

1 1 1 1 ( 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ' 1 1 1 i 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 I I 1 1 1 1 1 1 I 1 1 1 1 1 ! 1 1 1 1 

IALALLSVTISFRI YKGVIQAIQKSDEGHPFRAYLESEVAISEELVQKYSNSALGHVNST 


1073 


Db 


1014 


1073 


Qy 


1074 


IKELRRLFLVDDLVDSLKFAVLMWVFTYVGALFNGLTLLILALISLFSIPVIYERHQVQI 


1133 


Db 


1074 


llllll MINI jl MMIIIMM Mill III 1 EMM MMIMI III MMI II 

I KELRRLFLVDDLVDSLKFAVLMWVFTYVGALFNGLTLLI LALI SLFS I PVI YERHQAQ I 


1133 


Qy 


1134 


DHYLGLANKSVKDAMAKI QAKI PGLKRKAD 1163 




Db 


1134 


MMIMIII ■MMIMMMMIMM 

DHYLGLANKSVKDAMAKI QAKI PGLKRKAE 1163 





RESULT 3 
Q8BGK7 
ID 
AC 
DT 
DT 
DT 
DE 
GN 
OS 
OC 
OC 
OX 
RN 
RP 
RC 



PRELIMINARY; 



PRT; 1046 AA. 
Created) 

Last sequence update) 
Last annotation update) 



Craniata ; Vertebrata ; Euteleostomi ; 
Sciurognathi ; Muridae ; Murinae ; Mus . 



Q8BGK7 
Q8BGK7; 

01-MAR-2003 (TrEMBLrel . 23, 
01-MAR-2003 (TrEMBLrel- 23, 
01-MAR-2003 (TrEMBLrel. 23, 
RTN4 . 
RTN4 . 

Mus musculus (Mouse) . 
Eukaryota; Metazoa; Chordata; 
Mammalia; Eutheria; Rodentia; 
NCBI JTaxID=10090 ; 
[1] 

SEQUENCE FROM N.A. 
STRAIN=129/SvcJ7, and 129SvcJ7; 
RA Oertle T. , van der Putten H . , Schwab M.E.; 

RT "Genomic Structure and Functional Characterization of the Promoter 

RT Structures of Human and Mouse Nogo/Rtn-4 . " ; 

RL Submitted (OCT-2002) to the EMBL/GenBank/DDBJ databases. 

RN [2] 

RP SEQUENCE FROM N.A. 

RC STRAIN=12 9/SvcJ7, and 129SvcJ7; 

RA Oertle T. , Schwab M.E.; 

RL Submitted (MAY-2002) to the EMBL/GenBank/DDBJ databases. 
RN [3] 

RP SEQUENCE FROM N.A. 
RC STRAIN=129/SvcJ7; 
RA Van der Putten H. ; 

RL Submitted (MAY-2002) to the EMBL/GenBank/DDBJ databases. 
RN [4] 

RP SEQUENCE FROM N.A. 

RC STRAIN=129SvcJ7; 

RA Van der Putten H. , Mir A. ; 

RL Submitted (MAY-2002) to the EMBL/GenBank/DDBJ databases. 
DR EMBL; AY102280; AAM73502 . 1 ; -. 
DR EMBL; AY102286; AAM73507.1; 

SQ SEQUENCE 1046 AA; 114221 MW; 8CE2E2238ED51222 CRC64; 



Query Match 77.0%; Score 4501.5; DB 11; Length 1046; 

Best Local Similarity 86.9%; Pred. No. 5.5e-221; 

Matches 910; Conservative 40; Mismatches 68; Indels 29; Gaps 



4; 



Qy 

Db 



143 PPPAGASPLAEPAAPPSTPA — — -— - APKRRGSGSVDETLFALPAA 182 

INI h h III: : I I I I I I I I I 

3 PPLAGGGQKGGAASEAWVPSLFVGVSGSTCTAAKSLVPI PARSSRLSAARNETLFALPAA 62 



Qy 

Db 

Qy 
Db 

Qy 



183 SEPVI PSSAEKIMDLMEQPGNTVSSGQEDFPSVLLETAASLPSLSPLSTVSFKEHGYLGN 242 

MMMMMIMM MMMMMIMM M M Ml 1 1 1 II 1 1 1 1 1 1 Ml II 

63 SEPVI PSSAEKI MDLKEQPGNTVSSGQEDFPS VLFETAASLPSLS PLSTVSFKEHGYLGN 122 
243 LSAVSSSEGTIEETLNEASKELPERATNPFVNRDLAEFSELEYSEMGSSFKGSPKGESAI 3 02 

IMhhIMIIIIIIMhIMIIIMIMIh MM MIIIIMM IMIIIIh 

123 LSAVASTEGTIEETLNEASRELPERATNPFVNRESAEFSVLEYSEMGSSFNGSPKGESAM 182 
303 LVENTKEEVI VRS KDKEDLVCSAALHSPQES P VGKEDRWS PEKTMD I FNEMQMS 357 

M 1 1 M 1 1 1 1 1 II I M 1 1 1 1 1 M II h I II M I Ml h 1 1 1 1 1 1 II I II h 1 1 



Db 183 LVENTKEEVIVRSKDKEDLVCSAALHNPQESPATLTKWKEDGVMSPEKTMDIFNEMKMS 242 



Ov 

vy 


358 


WAPVREEYADFKPFEQAWEVKDTYEGSRDVLAARANVESKVDRKCLEDSLEQKSLGKDS 


417 


Db 


243 


IIIIIIIIIIIMIIIMIIIIIIIIIIIIIIMIIIMIIIIMI Illllll MM 

WAPVREEYADFKPFEQAWEVKI)TYEGSRDVLAARANMESKVDKKCFEDSLEQKGHGKDS 


302 


Ov 


418 


EGRNEDASFPSTPEPVKDSSRAYITCASFTSATESTTANTFPLLEDHTSENKTDEKKIEE 

1 111 = 1111 III III Illllll IN HI II II IMI 1 1 1 1 1 1 II II Ml II 


477 


Db 


303 


ESRNENAS F PRTPELVKDGSRAY I TCDSFSSATESTAANI F P VLEDHTS ENKTDEKKI EE 


362 


Ov 

VY 


478 


RKAQ 1 1 TEKTS P KTSN P FLVAVQDS EADYVTTDTLS KVTEAAVSNM PEGLT PDLVQEACE 


537 


Db 


363 


MIMMMMMMMMIM MIMIIIM Illllll 1 MIMIIIIIIIIII 

RKAQI ITEKTSPKTSNPFLVAIHDSEADYVTTDNLSKAn^EAWATMPEGLTPDLVQEACE 


422 


Ov 


538 


SELNEATGTKIAYETKVDLVQTSEAIQESLYPTAQLCPSFEEAEATPSPVLPDIVMEAPL 

1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 II 1 II 1 1 II 1 1 1 h 1 1 1 1 1 1 1 1 1 1 1 II 1 II 1 1 II 1 II II 1 1 1 1 1 


597 


Db 


423 


SELNEATGTKIAYETKVDLVQTSEAIQESI YPTAQLCPSFEEAEATPSPVLPDIVMEAPL 


482 


Ov 
vy 


598 


NSLLPSAGAS WQPS VS PLEAP PP VS YDS I KLEPENP P P YEEAMNVALKALGTKEGI KE P 

MUM MM III MM 1 Mill II II II 1 1 1 II 1 M h 1 1 M Ml MM 


657 


Db 


483 


NSLLPSTGASVAQPSASPLEVPSPVSYDGI KLEPENP PPYEEAMSVALKTSDSKEE I KEP 


542 


Ov 
vy 


658 


ESFNAAVQETEAPYI S I ACDLI KETKLSTEPS PDFSNYSE I AKFEKSVPEHAELVEDSS P 


717 


Db 


543 


MUM II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 = 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 = 1 Mhllll 

ESFNAAAQEAEAPYI S IACDLI KETKLSTEPSPEFSNYSEIAKFEKSVPDHCELVDDSSP 


602 


Ov 


718 


ESEPVDLFSDDSIPEVPQTQEEAVMLMKESLTEVSETVAQHK-EERLSASPQELGKPYLE 

IMIMIIIIIIIIIIIIIIIIMIMIIIMIIIMI III Mlllllllhllllll 


776 


Db 


603 


ESEPVDLFSDDSIPEVPQTQEEAVMLMKESLTEVSETVTQHKHKERLSASPQEVGKPYLE 


662 


Ov 
vy 


777 


SFQPNLHSTKDAASNDI PTLTKKEKI SLQMEEFNTAI YSNDDLLSSKEDKI KESETFSDS 


836 


Db 


663 


Illllll IMMMMMIMM IIIMIIIIIMIII IIMII Ihhlllll MM 

SFQPNLHITKDAASNEI PTLTKKETI SLQMEEFNTAI YSNDDLLSSKEDKMKESETFSDS 


722 


Ov 
vy 


837 


S P I E 1 1 DEFPTFVSAKDDS PKLAKE YTDLEVSDKS E I ANI QSGADSLPCLELPCDLSFKN 

1 II 1 1 1 II II 1 1 II II II 1 1 II 1 1 II II 1 M II 1 1 1 M II 1 M II 1 II 1 II II II 1 


896 


Db 


723 


S P I E 1 1 DEF PTFVSAKDDS P KEYTDLE VSNKSE I ANVQSGANSLPCS ELPCDLS FKN 


779 


Ov 
v y 


897 


IYPKDEVHVSDEFSENRSSVSKASISPSNVSALEPQTEMGSIVKSKSLTKEAEKKLPSDT 


956 


Db 


780 


Mill 1 1 " 1 : IIMII 1 Ml 1 MMMMIMM 

TYPKDEAHVSDEFSKSRSSVSKVPLLLPNVSALESQIEMGNIVKPKVLTKEAEEKLPSDT 


839 


Ov 
vy 


957 


EKEDRSLSAVLSAELSKTSWDLLYWRDI KKTGWFGASLFLLLSLTVFS I VSVTAYIAL 


1016 


Db 


840 


NIMH 1 Ml 1 1 1 Ml 1 1 II 1 III III MMMMMMIIMM MIIMI MM II 

EKEDRSLTAVTjSAELNKTSVVDLLYWRD I KKTGWFGASLFLLLSLTVFS I VSVTAYIAL 


899 


Ov 
vy 


1017 


ALLS VTI SFRI YKGVI QAI QKSDEGHPFRAYLESEVAI SEELVQKYSNSALGHVNSTI KE 

1 1 1 1 1 1 1 M 1 1 1 1 1 il',1 1 Ihlil , M 1 1 II IMII 1 M 1 1 1 1 1 1 1 1 1 IT 


1076 


Db 


900 


ALLSVTISFRIYKGVIQAIQKSDEGHPFRAYLESEVAISEELVQKYSNSALGHVNSTIKE 


959 


Qy 


1077 


LRRLFLVDDLVDSLKFAVLMWVFTYVGALFNGLTLLI LALI SLFS I PVI YERHQVQI DHY 


1136 


Db 


960 


; 1 1 1 iii 1 1 1 iii M 1 1 1 1 ii 1 1 1 ii i i ii 1 1 1 M 1 1 1 r ■ II 1 1 1 1 M Mill 

LRRLFLVDDLVDSLKFAVLMWVFTYVGALFNGLTLLI LAL I SLFS I PVI YERHQAQI DHY 


1019 


Qy 


1137 


LGLANKSVKDAMAKIQAKI PGLKRKAD 1163 




Db 


1020 


lllllMIIMIIIIMIIIMMIh 

LGLANKSVKDAMAKIQAKI PGLKRKAE 1046 





RESULT 
Q8IUA4 
ID 
AC 
DT 
DT 
DT 
DE 
GN 
OS 
OC 
OC 



OX 

RN 

RP 

RA 

RT 

RT 

RL 

RN 

RP 

RA 

RL 

RN 

RP 

RA 

RL 

RN 

RP 

RC 

RX 

RA 

RT 

RT 

RL 

DR 

DR 

DR 

DR 

DR 

DR 

DR - 

SQ 



Created) 

Last sequence update) 
Last annotation update) 



Craniata; Vertebrata; Euteleostomi ; 
Catarrhini; Hominidae; Homo. 



Q8IUA4 PRELIMINARY; PRT; 986 AA . 

Q8IUA4; 

01-MAR-2003 (TrEMBLrel . 23, 
01-MAR-2003 (TrEMBLrel. 23, 
01-MAR-2003 (TrEMBLrel. 23, 
RNT4 (RTN4) . 
RTN4 . 

Homo sapiens (Human) . 
Eukaryota; Metazoa; Chordata; 
Mammalia; Eutheria; Primates; 
NCBI_TaxID=9606; 
[1] 

SEQUENCE FROM N . A . 

Oertle T. , van der Putten H. , Schwab M.E.; 

"Genomic Structure and Functional Characterization of the Promoter 
Structures of Human and Mouse Nogo/Rtn-4 . " ; 
Submitted (OCT-2002) to the EMBL / GenBank / DDB J databases. 
[2] 

SEQUENCE FROM N.A. 
Oertle T. , Schwab M.E.; 

Submitted (MAY-2002) to the EMBL/ GenBank / DDB J databases. 
[3] 

SEQUENCE FROM N.A. 
Van der Putten H. ; 

Submitted (MAY-2 002) to the EMBL/ GenBank/DDB J databases. 
t4] 

SEQUENCE FROM N.A. 
TISSUE-Testis; 

MEDLINE=2237654 0; PubMed=1248 8 0 97 ; 

Oertle T., van der Putten H. , Schwab M.E.; 

"Genomic Structure and Functional Characterization of the Promoter 
Structures of Human and Mouse Nogo/Rtn-4 . " ; 
J. Mol. Biol. 325:299-323(2003)'. 



EMBL; AY102285; 

EMBL; AY123245; 

EMBL; AY123246; 

EMBL; AY123247; 

EMBL; AY12324 8; 

EMBL; AY123249; 

EMBL; AY 123250; 
SEQUENCE 986 



AAM64244 . 1 
AAM64249 . 1 
AAM64250. 1 
AAM64251 . 1 
AAM64252 . 1 
AAM64253 . 1 
AAM64254 . 1 
AA; 108449 MW; 



0CDE8F647036415A CRC64 ; 



Query Match 62 . 03 

Best Local Similarity 75.43 
Matches 745; Conservative 



Score 3627.5; DB 4; Length 986; 
Pred. No. 1.6e-176; 
96; Mismatches 126; Indels 21; Gaps 



13; 



Qy 

Db 

Qy 
Db 



195 MDLMEQPGNTVSSGQEDFPSVLLETAASLPSLSPLSTVSFKEHGYLGNLSAVSSSEGTI E 254 

Ml MiMhHIlM IIMMUMI Mill Mill MM I MM- 

1 MDLKEQPGNTI SAGQEDFPS VLLETAASLPSLS PLSAASFKEHEYLGNLSTVLPTEGTLQ 60 

255 ETLNEASKELPERATNPFVNRDLAEFSELEYSEMGSSFKGSPKGESAILVENTKEEVIVR 314 
I -llllh hi -IN I I I I I I I I i I I I I I Ml llhH I Hhlh 

61 ENVSEASKEVSEKAKTLLIDRDLTEFSELEYSEMGSSFSVSPKAESAVIVANPREEIIVK 120 



Qy 


315 


SKDKED - LVCSAALHS PQES PVG KEDRWSPEKTMDI FNEMQMSWAPVREEYAD 

: | | : [ : | | 1 1 : 1 1 1 MINIM 1 1 1 1 : : : 1 1 1 : 1 1 1 1 1 1 


368 


Db 


121 


ii i ii ii ii i iii iii it t ill i ii i i i i i i 

NKDEEEKLVSNNILHNQQELPTALTKLVKEDEWSSEKAKDSFNEKRVAVEAPMREEYAD 


180 


Qy 


369 


FKPFEQAWEVKDTYEGSRDVLAA RANVESKVDRKCLEDSLEQKSLGKDSEGRNEDA 


424 


Db 


181 


llllh 1:1 1 1 hill MMIMhll Mill : MM hi 

FKPFERVWEVKDSKEDS -DMLAAGGKI ESNLESKVDKKCFADSLEQTNHEKDSESSNDDT 


239 


Qy 


425 


SFPSTPEPVKDSSRAYITCASFT-SATESTTANTFPLLEDHTSENKTDEKKIEERKAQII 


483 


Db 


240 


II II I M : 1 1 1 1 1 1 1 1 1 1 Mill | MM | 1 1 II II ll M M 1 M 1 1 M 

SFPSTPEGIKDRSGAYITCAPFNPAATESIATNIFPLLGDPTSENKTDEKKIEEKKAQIV 


299 


Qy 


484 


TEK-TSPKTSNPFLVAVQDSEADYVTTDTLSKVTEAAVSNMPEGLTPDLVQEACESELNE 

Ml M M M M M 1 MM M M M MM;; 1 M 1 1 M 1 1 1 1 1 1 1 1 1 II II 1 M 

TEKNTSTKTSNPFLVAAQDSETDYVTTDNLTKVTEEWANMPEGLTPDLVQEACESELNE 


542 


Db 


300 


359 


Qy 


543 


ATGTKIAYETKVDLVQTSEAIQESLYPTAQLCPSFEEAEATPSPVLPDI VMEAPLNSLLP 


602 


Db 


360 


M M M M M M M M M M M M 1 1 1 1 1 1 II 1 1 M 1 1 1 1 1 1 M 1 1 1 1 1 1 M 1 ! M 

VTGTKIAYETKMDLVQTSEVMQESLYPAAQLCPSFEESEATPSPVLPDIVMEAPLNSAVP 


419 


Qy 


603 


SAGASWQPSVSPLEAPPPVSYDSIKLEPENPPPYEEAMNVALKAL-GTKEGIKEPESFN 


661 


Db 


420 


M M M M M M M 1 1 M M 1 1 1 1 1 II 1 1 1 1 II 1 M M 1 1 II 1 M 1 M 1 

SAGASVIQPSSSPLEA- SSVNYESI KHEPENPPPYEEAMSVSLKKVSGI KEEI KEPENIN 


478 


Qy 


662 


AAVQETEAPYISIACDLIKETKLSTEPSPDFSNYSEIAKFEKSVPEHAELVEDSSPESEP 


721 


Db 


479 


M M M M M M M M M M M M 1 1 M 1 M M II M 1 M 1 1 M M 1 1 M M 1 M 1 1 

AALQETEAPYISIACDLIKETKLSAEPAPDFSDYSEMAKVEQPVPDHSELVEDSSPDSEP 


538 


Qy 


722 


VDLFSDDSIPEVPQTQEEAVMLMKESLTEVS-ETVAQHK-EERLSASPQELGKPYLESFQ 

IIMIIMMMM hi MIMIMM 1 1- ::: MM II 1 1 IMIIIIM 


779 


Db 


539 


VDLFSDDSI PDVPQKQDETVMLVKESLTETSFESMIEYENKEKLSALPPEGGKPYLESFK 


598 


Qy 


780 


PNLHSTKDA-ASNDIPTLTKKEKISLQMEEFNTAIYSNDDLLSSKEDKIKESETFSDSSP 


838 


Db 


599 


M MM : : : M M 1 1 M Mill MIMIMM III MMMIIMMI 

LSLDNTKDTLLPDEVSTLSKKEKIPLQMEELSTAVYSNDDLFISKEAQIRETETFSDSSP 


658 


Qy 


839 


IEI IDEFPTFVSAKDDS-PKLAKEYTDLEVSDKSEIANIQSGADSLPCLELPCDLSFKNI 


897 


Db 


659 


M || | M 1 1 MM M 1 1 1 M II 1 1 II 1 1 II 1 1 1 II MM II 1 III III 

IEIIDEFPTLISSKTDSFSKLAREYTDLEVSHKSEIANAPDGAGSLPCTELPHDLSLKNI 


718 


Qy 


898 


YPK--DEVHVSDEFSENRSSVSKASISPSNVSALEPQTEMGSIVKSKSLTKEAEKKLPSD 
M MMIMhll | : ! | | | | | : || M 1 1 II II II II II 


955 


Db 


719 


Q P KVE EKISFSDDFS KNG S ATS KVLLL P P D VS ALATQAE I E S I VKP KVL VKEA E KKL PSD 


778 


Qy 


956 


TEKEDRSLSAVLSAELSKTSWDLLYWRDIKKTGWFGASLFLLLSLTVFSIVSVTAYIA 


1015 


Db 


779 


IMIII! Ih IIIIIMIIIMIMMIIIMIIIIIMMMIIIIIIIIIIMIII 

TEKEDRSPSAIFSAELSKTSWDLLYWRDIKKTGWFGASLFLLLSLTVFSIVSVTAYIA 


838 


Qy 


1016 


LALLS VTI SFRI YKGVI QAI QKSDEGHPFRAYLESEVAI SEELVQKYSNSALGHVNSTI K 


1075 


Db 


839 


MM M M M MMM IM M MM M MM M M M MM M M MM M MM III 

LALLSVTI SFRI YKGVI QAI QKSDEGHPFRAYLESEVAI SEELVQKYSNSALGHVNCTIK 


898 


Qy 


1076 


ELRRLFLVDDLVDSLKFAVLMWVFTYVGALFNGLTLLI LAL I SLFS I P VI YERHQVQ I DH 

1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 II 1 1 II II II 1 1 M 1 1 1 M 1 1 1 1 1 1 1 II 1 1 M 1 M II 1 1 MM 

ELRRLFLVDDLVDSLKFAVLMWVFTYVGALFNGLTLLI LAL I SLFSVPVI YERHQAQI DH 


1135 


Db 


899 


958 



Qy 1136 YLGLANKS VKDAMAKI QAKI PGLKRKAD 1163 

III IhlllMllMlM -IMIh 

Db 959 YLG LAN KN VKDAMAKI QAKI PGLKRKAE 986 

RESULT 5 
Q8K290 

ID Q8K290 PRELIMINARY; PRT; 639 AA. 

AC Q8K290; 

DT 01-OCT-2002 (TrEMBLrel . 22, Created) 

DT 01-OCT-2002 (TrEMBLrel. 22, Last sequence update) 

DT 01-MAR-2003 (TrEMBLrel. 23, Last annotation update) 

DE Hypothetical protein. 

GN RTN4 . 

OS Mus musculus (Mouse) . 

OC Eukaryota; Metazoa; Chordata; Crania ta; Vertebrata,- Euteleostomi ; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus. 

OX NCBI_TaxID=10090 ; 

RN [1] 

RP SEQUENCE FROM N . A . 

RA Strausberg R. ; 

RL Submitted (JUN-2002) to the EMBL/GenBank/DDBJ databases. 

DR EMBL ; BC032192; AAH32192.1; 

DR MGD; MGI: 1915835; Rtn4 . 

DR InterPro; IPR003388; Reticulon. 

DR Pfam; PF02453; Reticulon; 1. 

DR PROSITE; PS50845; RETICULON; 1. 

KW Hypothetical protein. 

SQ SEQUENCE 639 AA; 70312 MW; 3 09A19DA37603F11 CRC64 ; 

Query Match 50.0%; Score 2926; DB 11; Length 639; 

Best Local Similarity 91.6%; Pred. No. 4.7e-141; 

Matches 588; Conservative 20; Mismatches 30; Indels 4; Gaps 2; 
Qy 523 MPEGLTPDLVQEACESELNEATGTKIAYETKVDLVQTSEAIQESLYPTAQLCPSFEEAEA 582 

III IIIIMIIMIIII lllllllllllll ■ lllllhllllii: llllll 

Db 1 MPEGLTPDLVQEACESELNEATGTKIAYETKVDLVQTSEAIQESI YPTAQLCPSFEEAEA 60 

Qy 583 TPSPVLPDIVMEAPLNSLLPSAGASWQPSVSPLEAPPPVSYDSIKLEPENPPPYEEAMN 642 

MM II MM Mill Mill MM Ml MM I MMI I II 1 1 1 1 1 1 1 1 1 1 M 

Db 61 TPSPVLPDI VMEAPLNSLLPSTGASVAQPSASPLEVPSPVSYDGIKLEPENPPPYEEAMS 120 

Qy 643 VALKALGTKEGIKEPESFNAAVQETEAPYISIACDLIKETKLSTEPSPDFSNYSEIAKFE 702 

llll II llllllllll II lllllllllllllllllillllhlllllllllll 

Db 121 VALKTSDAKEEIKEPESFNAAAQEAEAPYISIACDLIKETKLSTEPSPEFSNYSEIAKFE 180 

Qy 703 KSVPEHAELVEDSSPESEPVDLFSDDSIPEVPQTQEEAVMLMKESLTEVSETVAQHK-EE 761 

MMM 1 1 1 M II M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II II 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 Ml ■ I 

Db 181 KSVPDHCELVDDSSPESEPVTJLFSDDSIPEVPQTQEEAVMLMKESLTEVSETVTQHKHKE 24 0 

Qy 762 RLSAS PQELGKPYLESFQPNLHSTKDAASNDI PTLTKKEKI SLQMEEFNTAI YSNDDLLS 821 

mi MHnmiiu iiiiiihiiMiiii iiiMiiiii 1 iiiiM 

Db 241 RLSASPQEVGKPYLESFQPNLHITKDAASNEI PTLTKKETI SLQMEEFNTAI YSNDDLLS 300 

Qy 822 S KEDKI KES ETFSDS S P I E 1 1 DE F PTF VSAKDDS P KLAKE YTDLE VS DKS E I AN I QSGAD 881 

1 1 1 . 1 1 1 1 1 1! 1 1 1 1 1 1 1 Ml 1 1 1 ii: 1 1 1 1 1 1 1 h 1 1 ; h 1 1 1 h 

Db 301 SKEDKMKESETFSDSSPIEI IDEFPTFVSAKDDSP- - -KEYTDLEVSNKSEIANVQSGAN 357 



Qy 882 SLPCLELPCDLSFKNIYPKDEVHVSDEFSENRSSVSKASISPSNVSALEPQTEMGSIVKS 941 

1 1 II III MM Mill IMIIMMI II : Mill I MMM 

Db 358 SLPCSELPCDLSFKNTYPKDEAHVSDEFSKSRSSVSKVPLLLPNVSALESQIEMGNIVKP 417 

Qy 942 KSLTKEAEKKLPSDTEKEDRSLSAVLSAELSKTSWDLLYWRDIKKTGWFGASLFLLLS 1001 

I IIIMhIIIIIIIIMMhMllllhMIIIIIIIIIMIIIIIIIMIIIIIII 

Db 418 KVLTKEAEEKLPSDTEKEDRSLTAVLSAELNKTSVVDLLYWRDIKKTGWFGASLFLLLS 477 

Qy 1002 LTVFSIVSVTAYIALALLSVTISFRI YKGVIQAIQKSDEGHPFRAYLESEVAISEELVQK 1061 

MMMM MMIMIMI IMIMIMI MIIIIMM IMIMMIM I 

Db 478 LTVFSIVSVTAYIALALLSVTISFRI YKGVIQAIQKSDEGHPFRAYLESEVAISEELVQK 537 

Qy 1062 YSNSALGHVNSTIKELRRLFLVDDLVDSLKFAVLMWVFTYVGALFNGLTLLI LALI SLFS 1121 

MMMM IMIMIMM IMIIMMI MMMIIMI MIMMIMM I 

Db 538 YSNSALGHVNSTI KELRRLFLVDDLVDSLKFAVLMWVFTYVGALFNGLTLLILALI SLFS 597 

Qy 1122 I PVI YERHQVQI DHYLGLANKSVKDAMAKI QAKI PGLKRKAD 1163 

Mill MM III MMMM I MIIIIMM II MMMM 

Db 598 I PVI YERHQAQI DHYLGLANKSVKDAMAKI QAKI PGLKRKAE 639 

RESULT 6 
Q8BHF5 

ID Q8BHF5 PRELIMINARY; PRT; 375 AA. 

AC Q8BHF5; 

DT 01 -MAR-2 0 03 (TrEMBLrel . 23, Created) 

DT 01-MAR-2003 (TrEMBLrel. 23, Last sequence update) 

DT 01-MAR-2003 (TrEMBLrel. 23, Last annotation update) 

DE RTN4 . 

GN RTN4 . 

OS Mus musculus (Mouse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus. 

OX NCBI_TaxID=10090; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAI N= 12 9/ Svc J7 , and 129SvcJ7; 

RA Oertle T. , van der Putten H . , Schwab M.E.; 

RT "Genomic Structure and Functional Characterization of the Promoter 

RT Structures of Human and Mouse Nogo/Rtn-4 . " ; 

RL Submitted (OCT-2002) to the EMBL/ GenBank/DDB J databases. 

RN [2] 

RP SEQUENCE FROM N.A. 

RC STRAIN=12 9/SvcJ7, and 12 9SvcJ7; 

RA Oertle T., Schwab M.E.; 

RL Submitted (MAY-2002) to the EMBL/ GenBank/DDB J databases. 

RN [3] 

RP SEQUENCE FROM N.A. 

RC STRAIN=129/SvcJ7; 

RA Van der Putten H. ; 

RL Submitted (MAY-2002) to the EMBL/ GenBank/DDB J databases. 

RN [4] 

RP SEQUENCE FROM N.A. 

RC. STRAIN=129SvcJ7; 

RA Van der Putten H . , Mir A. ; 

RL Submitted (MAY-2002) to the EMBL/GenBank/DDBJ databases. 



DR EMBL; AY102282; AAM73504 . 1 ; -. 
DR EMBL; AY102286; AAM73509.1; 

SQ SEQUENCE 375 AA; 40300 MW; 23D9EB1 9BE67 1AE6 CRC64; 

Query Match 24.2%; Score 1416; DB 11; Length 375; 

Best Local Similarity 31.6%; Pred. No. 1.8e-64; 

Matches 367; Conservative 2; Mismatches 6; Indels 788; Gaps 4; 
Qy 1 MEDIDQSSLVSSSTDSPPRPPPAFKYQFVTEPEDEEDEEEEEDEEEDDEDLEELEVLERK 6 0 

lllllllllllll I Mill: 1 1 1 1 1 1 1 i 1 1 ; 1 ! 1 1 : 1 : I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

Db 1 MEDIDQSSLVSSSADSPPRPPPAFKYQFVTEPEDEEDEEDEE-EEEDDEDLEELEVLERK 5 9 

Qy 61 PAAGLSAAAVPPAAAAPLLDFSSDSVPPAPRGPLPAAPPAAPERQPSWERSPAAPAPSLP 120 

I I I I I I I I III I I I I I I II I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 60 PAAGLSAAPVPP-AAAPLLDFSSDSVPPAPRGPLPAAPPTAPERQPSWERSPAASAPSLP 118 

Qy 121 PAAAVLPSKLPEDDEPPARPPPPPPAGASPLAEPAAPPSTPAAPKRRGSGSVDETLFALP 18 0 

IIIIIIIIIIIIIIIIMI II Mill Mill lllllill IIIIMMIIIIIIIII 

Db 119 PAAAVLPSKLPEDDEPPAR- - PPAPAGASPLAEPAAPPSTPAAPKRRGSGSVDETLFALP 176 

Qy 181 AASEPVI PSSAEKIMDLMEQPGNTVSSGQEDFPSVLLETAASLPSLSPLSTVSFKEHGYL 24 0 



Db 177 AASEPVI PSSA 187 

Qy 241 GNLSAVSSSEGTI EETLNEASKELPERATNPFVNRDLAEFSELEYSEMGSSFKGSPKGES 300 

Db 188 187 

Qy 3 01 AILVENTKEEVIVRSKDKEDLVCSAALHSPQESPVGKEDRWSPEKTMDIFNEMQMSWA 360 

Db 188 187 

Qy 361 PWEEYADFKPFEQAWEVKDTYEGSRDVT.AARANVESKVDRKCLEDSLEQKSLGKDSEGR 42 0 

Db 188 187 

Qy 421 NEDASFPSTPEPVKDSSRAYI TCASFTSATESTTANTFPLLEDHTSENKTDEKKI EERKA 480 

Db 188 187 

Qy 481 QIITEKTSPKTSNPFLVAVQDSEADYVTTDTLSKVTEAAVSNMPEGLTPDLVQEACESEL 54 0 

Db 188 187 

Qy 541 NEATGTKIAYETKVDLVQTSEAIQESLYPTAQLCPSFEEAEATPSPVLPDIVMEAPLNSL 600 

Db 188 187 

Qy 601 LPSAGASWQPSVSPLEAPPPVSYDSIKLEPENPPPYEEAMNVALKALGTKEGIKEPESF 660 

Db 188 187 

Qy 661 NAAVQETEAPYI S IACDLI KETKLSTEPSPDFSNYSEI AKFEKSVPEHAELVEDSSPESE 720 

Db 188 187 

Qy 721 PVDLFSDDSI PEVPQTQEEAVMLMKESLTEVSETVAQHKEERLSASPQELGKPYLESFQP 78 0 



Db 



188 



187 



Qy 781 NLHSTKDAASNDI PTLTKKEKISLQMEEFNTAI YSNDDLLSSKEDKI KESETFSDSSPIE 840 

Db 188 187 

Qy 841 I IDEFPTFVSAKDDSPKLAKEYTDLEVSDKSEIANIQSGADSLPCLELPCDLSFKNI YPK 900 

Db 188 187 

Qy 901 DEVHVSDEFSENRSSVSKASISPSNVSALEPQTEMGSIVKSKSLTKEAEKKLPSDTEKED 960 

Db 188 187 

Qy 961 RSLSAVLSAELSKTSWDLLYWRDIKKTGWFGASLFLLLSLTVFSIVSVTAYIALALLS 102 0 

I: 1 1 1 1! 1 1 1 1 1 Mi I IN 1 1 i 1 1 II ! 1 1 1 1 1 1 1 1 1 1 1 1 I 

Db 188 WDLLYWRDI KKTGWFGASLFLLLSLTVFS I VSVTAYIALALLS 232 

Qy 1021 VTI SFRI YKGVI QAI QKSDEGHPFRAYLESEVAI SEELVQKYSNSALGHVNSTI KELRRL 1080 

1 1 1 M! I II M 1 1 1 1 ll I M II I M 1 1 II M I M 1 1 : 1 1 II 1 1 M 1 1 1! 1 1 1 1 1 i 

Db 233 VTI SFRI YKGVI QAI QKSDEGHPFRAYLESEVAI SEELVQKYSNSALGHVNSTI KELRRL 292 

Qy 1081 FLVDDLVDSLKFAVLMWVFTYVGALFNGLTLLI LALI SLFS I PVI YERHQVQI DHYLGLA 1140 

IIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIMIIIIIII! 

Db 2 93 FLVDDLVDSLKFAVLMWVFTYVGALFNGLTLLI LALI SLFS I PVI YERHQAQI DHYLGLA 352 

Qy 1141 NKS VKDAMAKI QAKI PGLKRKAD 1163 

I ! I I I I I II ,1 I I I I M I I I I 

Db 353 NKS VKDAMAKI QAKI PGLKRKAE 375 

RESULT 7 
Q8BH78 

ID Q8BH78 PRELIMINARY; PRT; 356 AA. 

AC Q8BH78; 

DT 01-MAR-2003 (TrEMBLrel . 23, Created) 

DT 01-MAR-2003 (TrEMBLrel . 23, Last sequence update) 

DT 01-MAR-2003 (TrEMBLrel. 23, Last annotation update) 

DE RTN4 . 

GN RTN4 . 

OS Mus musculus (Mouse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi; Muridae; Murinae; Mus. 

OX NCBI_TaxID=10090; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=12 9/SvcJ7, and 129SvcJ7; 

RA Oertle T. , van der Putten H. , Schwab M.E.; 

RT "Genomic Structure and Functional Characterization of the Promoter 

RT Structures of Human and Mouse Nogo/Rtn-4 . " ; 

RL Submitted (OCT-2002) to the EMBL/GenBank/DDBJ databases. 

RN [2] 

RP SEQUENCE FROM N.A. 

RC STRAIN=129/SvcJ7, and 129SvcJ7; 

RA Oertle T. , Schwab M.E.; 

RL Submitted (MAY-2002) to the EMBL/GenBank/DDBJ databases. 



RN [3] 

RP SEQUENCE FROM N.A. 

RC STRAIN=129/SvcJ7; 

RA Van der Put ten H . ; 

RL Submitted (MAY-2002) to the EMBL/ GenBank/DDBJ databases. 

RN [4] 

RP SEQUENCE FROM N.A. 

RC STRAIN-12 9SVCJ7; 

RA Van der Putten H. , Mir A.; 

RL Submitted (MAY-2002) to the EMBL / GenBank / DDB J databases. 

DR EMBL; AY102281; AAM73503.1; -. 

DR EMBL; AY102286; AAM73508.1; -. 

SQ SEQUENCE 356 AA; 38403 MW; 43 66C03BA963 0B56 CRC64 ; 



Query Match 22.5%; Score 1314.5; DB 11; Length 356; 

Best Local Similarity 29.9%; Pred. No. 2.5e-59; 

Matches 348; Conservative 2; Mismatches 6; Indels 807; Gaps 4 
Qy 1 MEDIDQSSLVSSSTDSPPRPPPAFKYQFVTEPEDEEDEEEEEDEEEDDEDLEELEVLERK 60 

iiiiiiiiiiiii iiiiiiui.ru i 

Db 1 MEDIDQSSLVSSSADSPPRPPPAFKYQFVTEPEDEEDEEDEE-EEEDDEDLEELEVLERK 59 

Qy 61 PAAGLSAAAVPPAAAAPLLDFSSDSVPPAPRGPLPAAPPAAPERQPSWERSPAAPAPSLP 12 0 

I M 1 1 II I II! MMIIIIIIIIMIMIMIIIIII 1 1 1 1 1 1 1 1 1 1 1 1 1 1 Mill 

Db 60 PAAGLSAAPVPP-AAAPLLDFSSDSVPPAPRGPLPAAPPTAPERQPSWERSPAASAPSLP 118 

Qy 121 PAAAVIjPSKLPEDDEPPARPPPPPPAGASPLAEPAAPPSTPAAPKRRGSGSVDETLFALP 180 



Db 119 PAAAVLPS KLPEDDEP PAR - - P PAPAGAS PLAEPAAP PSTPAAPKRRGSGS V 168 

Qy 181 AASEPVI PSSAEKIMDLMEQPGNTVSSGQEDFPSVLLETAASLPSLSPLSTVSFKEHGYL 240 

Db 169 _ 168 

Qy 241 GNLSAVSSSEGTIEETLNEASKELPERATNPFVNRDLAEFSELEYSEMGSSFKGSPKGES 3 00 

Db 169 168 

QY 301 A I LVENTKEEVI VRS KDKEDLVCSAALHS PQES PVGKEDRWS PEKTMD I FNEMQMS WA 360 

Db 169 1 68 

Qy 3 61 PWEEYADFKPFEQAWEVKDTYEGSRDV1AARANVESKVDRKCLEDSLEQKSLGKDSEGR 42 0 

Db 169 168 

QY 421 NEDASFPSTPEPVKDSSRAYI TCASFTSATESTTANTFPLLEDHTSENKTDEKKI EERKA 480 

Db 169 168 

Qy 481 QIITEKTSPKTSNPFLVAVQDSEADYVTTDTLSKVTEAAVSNMPEGLTPDLVQEACESEL 540 

Db 169 168 

Qy 541 NEATGTKIAYETKVDLVQTSEAIQESLYPTAQLCPSFEEAEATPSPVLPDIVMEAPLNSL 600 



Qy 601 LPSAGASWQPSVSPLEAPPPVSYDSIKLEPENPPPYEEAMNVALKALGTKEGIKEPESF 660. 

Db 169 168 

Qy 661 NAAVQETEAPYI S I ACDLI KETKLSTEPSPDFSNYSEIAKFEKSVPEHAELVEDSSPESE 720 

Db 169 168 

Qy 721 PVDLFSDDSIPEVPQTQEEAVMLMKESLTEVSETVAQHKEERLSASPQELGKPYLESFQP 780 

Db 169 168 

Qy 781 NLHSTKDAASNDIPTLTKKEKISLQMEEFNTAI YSNDDLLSSKEDKIKESETFSDSSPIE 840 

Db 169 168 

Qy 841 1 1 DEFPTFVSAKDDS PKLAKEYTDLEVSDKSE I ANI QSGADSLPCLELPCDLSFKNI YPK 900 

Db 169 168 

Qy 901 DEVHVSDEFSENRSSVSKASISPSNVSALEPQTEMGSIVKSKSLTKEAEKKLPSDTEKED 960 

Db 169 168 

Qy 961 RSLSAVLSAELSKTSWDLLYWRDIKKTGWFGASLFLLLSLTVFSI VSVTAYIALALLS 1020 

M 1 1 1 II M 1 1 1 1 1! M 1 1 II III 1 1 1 1 I 1 1 1 M Ml I 

Db 169 WDLLYWRDI KKTGWFGASLFLLLSLTVFS I VS VTAY I ALALLS 213 

Qy 1021 VTISFRI YKGVIQAIQKSDEGHPFRAYLESEVAISEELVQKYSNSALGHVNSTIKELRRL 1080 

IIIIIIIMIIIMIIIIIIIIIIIIMIIIIIIIIIIIIIIMIIIIIIMIIIIIIII 

Db 214 VTI SFRI YKGVI QAI QKSDEGHPFRAYLESEVAI SEELVQKYSNSALGHVNSTI KELRRL 273 

Qy 1081 FLVDDLVDSLKFAVLMWVFTYVGALFNGLTLLI LALI SLFS I PVI YERHQVQI DHYLGLA 114 0 

MINIM MM MINIM III III MINIM llll.llllllllllll 

Db 274 FLVDDLVDSLKFAVLMWVFTYVGALFNGLTLL I LALI SLFS I PVI YERHQAQ I DHYLGLA 333 

Qy 1141 NKSVKDAMAKIQAKI PGLKRKAD 1163 

Db 334 NKSVKDAMAKIQAKI PGLKRKAE 356 

RESULT 8 
Q8K3G7 

ID Q8K3G7 PRELIMINARY; PRT; 357 AA . 

AC Q8K3G7; 

DT 01-OCT-2002 (TrEMBLrel . 22, Created) 

DT 01-OCT-2002 (TrEMBLrel. 22, Last sequence update) 

DT 01-MAR-2003 {TrEMBLrel. 23, Last annotation update) 

DE Nogo-B. 

GN RTN4 . 

OS Mus musculus (Mouse) . 

OC Eukaryota; Metazoa,- Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus. 

OX NCB I_TaxI D= 1 0 0 9 0 ; 

RN [1] 

RP SEQUENCE FROM N . A . 



RC STRAIN=BALB/ c ; 

RA Jin W., Li R. , Long M. , Shen J. , Ju G. ; 

RT "Cloning and expression of the mouse Nogo-B protein."; 

RL Submitted (MAY-2002) to the EMBL/GenBank/DDBJ databases. 

DR EMBL; AY114153; AAM77069.1; -. 

DR MGD; MGI : 1915835 ; Rtn4 . 

DR InterPro; IPR003388; Reticulon. 

DR Pfam; PF02453; Reticulon; 1. 

DR PROSITE; PS50845; RETICULON; 1. 

SQ SEQUENCE 357 AA; 38566 MW; 73BB3D17DFDBDF15 CRC64 ; 



Query Match 22.3%; Score 13 04; DB 11; Length 357; 

Best Local Similarity 29.9%; Pred. No. 8.4e-59; 

Matches 348; Conservative 2; Mismatches 6; Indels 808; Gaps 5; 
Qy 1 MEDIDQSSLVSSSTDSPPRPPPAFKYQFVTEPEDEEDEEEEEDEEEDDEDLEELEVLERK 60 

iimiiiiiiii 11111111111111111111111ih.il ii in ii 1 1 1 ii 1 1 1 ii 

Db 1 MEDIDQSSLVSSSADSPPRPPPAFKYQFVTEPEDEEDEEDEE-EEEDDEDLEELEVLERK 59 

Qy 61 PAAGLSAAAVPPAAAAPLLDFSSDSVPPAPRGPLPAAPPAAPERQPSWERSPAAPAPSLP 120 

MINIM III I M Ml II MM I M M Ml II II MIIIIIIIIIIM Mill 

Db 6 0 PAAGLSAAPVPP-AAAPLLDFSSDSVPPAPRGPLPAAPPTAPERQPSWERSPAASAPSLP 118 

Qy 121 PAAAVLPSKLPEDDEPPARPPPPPPAGASPLAEPAAPPSTPAAPKRRGSGSVDETLFALP 180 



Db 119 PAAAVLPSKLPEDDEPPAR- -PPAPAGASPLAEPAAPPSTPAAPKRRGSGSV 168 

Qy 181 AASEPVI PSSAEKIMDLMEQPGNTVSSGQEDFPSVLLETAASLPSLSPLSTVSFKEHGYL 240 

Db 169 168 

Qy 241 GNLSAVSSSEGTIEETLNEASKELPERATNPFVNRDLAEFSELEYSEMGSSFKGSPKGES 300 

Db 169 168 

Qy 301 AILVENTKEEVIVRSKDKEDLVCSAALHSPQESPVGKEDRWSPEKTMDIFNEMQMSWA 360 

Db 169 168 

Qy 3 61 PWEEYADFKPFEQAWEVKDTYEGSRDVLAARANVESKVDRKCLEDSLEQKSLGKDSEGR 42 0 

Db 169 168 

Qy 421 NEDASFPSTPEPVKDSSRAYITCASFTSATESTTANTFPLLEDHTSENKTDEKKIEERKA 480 

Db 169 168 

Qy 481 Q 1 1 TEKTS P KTSN P FL VAVQDS EAD YVTTDTLS KVTEAAVSNM PEGLT PDLVQEACES EL 54 0 

Db 169 168 

Qy 541 NEATGTKIAYETKVDLVQTSEAIQESLYPTAQLCPSFEEAEATPSPVLPDI VMEAPLNSL 600 

Db 169 168 

Qy 601 L P S AGA S WQ P S VS P LEA P P P VS YDS I KLE P EN P P P Y E EAMN VAL KALGT KEG I KE P E S F 660 



Db 169 168 

Qy 661 NAAVQETEAPYI SIACDLI KETKLSTEPSPDFSNYSEIAKFEKSVPEHAELVEDSSPESE 72 0 

Db 169 168 

Qy 721 PVDLFSDDSIPEVPQTQEEAVMLMKESLTEVSETVAQHKEERLSASPQELGKPYLESFQP 780 

Db 169 168 

Qy 781 NLHSTKDAASND I PTLTKKEKI SLQMEEFNTAI YSNDDLLSS KEDKI KES ETFSDSS P I E 840 

Db 169 168 

Qy 841 IIDEFPTFVSAKDDSPKLAKEYTDLEVSDKSEIANIQSGADSLPCLELPCDLSFKNIYPK 900 

Db 169 168 

Qy 901 DEVHVSDEFSENRSSVSKASISPSNVSALEPQTEMGSIVKSKSLTKEAEKKLPSDTEKED 960 

Db 169 168 



Qy 961 RSLSAVLSAELSKTSWDLLYWRDIKKTGW-FGASLFLLLSLTVFSIVSVTAYIAliALL 1019 

Mill III Mill III 1 1 ! 1 1 1 1 II I Ml 1 1 M II I II I II I 

Db 169 WDLLYWRDIKKTGWYFGASLFLLLSLTVFSIVSVTAYIALALL 213 

Qy 1020 SVTI SFRI YKGVI QAI QKSDEGHPFRAYLESEVAI SEELVQKYSNSALGHVNSTI KELRR 1079 

I I I I I I I I I I I I 1 I I II I I I 1 I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I II I I I I I 
Db 214 SVTI SFRI YKGVI QAI QKSDEGHPFRAYLESEVAI SEELVQKYSNSALGHVNSTI KELRR 273 

Qy 1080 LFLVDDLVDSLKFAVLMWVFTYVGALFNGLTLLI LALI SLFS I PVI YERHQVQI DHYLGL 1139 

MMMMIMIMMMMMMMIMMMIMMIMIMMIMM Illlllll 

Db 274 LFLVDDLVDSLKFAVLMWVFTYVGALFNGLTLLI LALI SLFS I PVI YERHQAQ I DHYLGL 333 

Qy 114 0 ANKS VKDAMAKI QAKI PGLKRKAD 1163 

1 1 1 1 II I M II I II I II I M II M 

Db 334 ANKS VKDAMAKI QAKI PGLKRKAE 357 



RESULT 9 
Q96B16 

ID Q96B16 PRELIMINARY; PRT; 392 AA . 

AC Q96B16; 

DT 0-1 -DEC- 2 001 (TrEMBLrel . 19, Created) 

DT 01-DEC-2001 (TrEMBLrel. 19, Last sequence update) 

DT 01-MAR-2003 (TrEMBLrel. 23, Last annotation update) 

DE Hypothetical protein (RTN4) . 

GN RTN4. 

OS Homo sapiens (Human) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

OX NCBI_TaxID=9606; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC TISSUE=Kidney; 

RA Strausberg R.; 

RL Submitted (OCT-2001) to the EMBL/ GenBank / DDB J databases. 



RN [2] 

RP SEQUENCE FROM N.A. 

RA Oertle T., van der Putten H . , Schwab M.E.; 

RT "Genomic Structure and Functional Characterization of the Promoter 

RT Structures of Human and Mouse Nogo/Rtn-4 . " ; 

RL Submitted (OCT-2002) to the EMBL/GenBank/DDBJ databases. 

RN [3] 

RP SEQUENCE FROM N.A. 

RA Oertle T. , Schwab M.E.; 

RL Submitted (MAY-2002) to the EMBL/GenBank/DDBJ databases. 

RN [4] 

RP SEQUENCE FROM N.A. 

RA Van der Putten H.; 

RL Submitted (MAY-2002) to the EMBL/GenBank/DDBJ databases. 

RN [5] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=2237654 0; PubMed=12488 097 ; 

RA Oertle T. , van der Putten H., Schwab M.E.; 

RT "Genomic Structure and Functional Characterization of the Promoter 

RT Structures of Human and Mouse Nogo/Rtn-4."; 

RL J. Mol. Biol. 325:299-323(2003). 

DR EMBL; BC016165; AAH16165.1; -. 

DR EMBL; AY102285; AAM64242.1; -. 

DR EMBL; AY102278; AAM64247.1; 

DR InterPro; IPR003388; Reticulon. 

DR Pfam; PF02453 ; Reticulon; 1 . 

DR PROSITE; PS50845; RETICULON; 1. 

KW Hypothetical protein. 

SQ SEQUENCE 392 AA; 42274 MW; D7B2AA5E839E58AD CRC64 ; 

Query Match 21.9%; Score 1283.5; DB 4; Length 392; 

Best Local Similarity 29.4%; Pred. No. l.le-57; 

Matches 346; Conservative 11; Mismatches 20; Indels 801; Gaps 7; 
Qy 1 MEDIDQSSLVSSSTDSPPRPPPAFKYQFVTEPEDEEDEEEEEDEEEDDEDLEELEVLERK 60 

iihiii inn him iiiiiiii mill 1 1 in i- 1 1 1 1 1 1 ii mi i 

Db 1 MEDLDQSPLVSSS-DSPPRPQPAFKYQFVREPEDEE-EEEEEEEEDEDEDLEELEVLERK 58 
Qy 61 PAAGLSAAAVP- - PAAAAPLLDFSSDSVPPAPRGPLPAAPPAAPERQPSWERSPAA P 115 

iiiiiiii ii iii 1 1 hi i =1 1 1 1 r 1 1 1 1 1 1 1 1 1 1 mi iiiii- im i 

Db 5 9 PAAGLSAAPVPTAPAAGAPLMDFGNDFVPPAPRGPLPAAPPVAPERQPSWDPSPVSSTVP 118 
Qy 116 APSLPPAAAVLPSKLPEDDEPPARPPPPPPAGASPLAE PAAPPSTPAAPKR 166 

III I III I II I III II I II III II I II II II IIIIIIIIMIII ; 

Db 119 APSPLSAAAVSPSKLPEDDEPPARPPPPPPASVSPQAEPVWTPPAPAPAAPPSTPAAPKR 178 

Qy 167 RG-SGSVDETLFALPAASEPVI PSSAEKIMDLMEQPGNTVSSGQEDFPSVLLETAASLPS 22 5 

II lllllllllllllllllll III 
Db 17 9 RGSSGSVDETLFALPAASEPVIRSSA 2 04 

Qy 226 LSPLSTVSFKEHGYLGNLSAVSSSEGTIEETLNEASKELPERATNPFVNRDLAEFSELEY 285 

Db 205 204 

Qy 286 SEMGSSFKGSPKGESAILVENTKEEVIVRSKDKEDLVCSAALHSPQESPVGKEDRWSPE 345 



Db 



205 



204 



Qy 346 KTMDIFNEMQMSWAPVREEYADFKPFEQAWEVKDTYEGSRDVLAARANVESKVDRKCLE 4 05 

Db 205 204 

Qy 406 DSLEQKSLGKDSEGRNEDASFPSTPEPVKDSSRAYITCASFTSATESTTANTFPLLEDHT 465 

Db 205 204 

Qy 466 SENKTDEKKI EERKAQI ITEKTSPKTSNPFLVAVQDSEADYVTTDTLSKVTEAAVSNMPE 525 

Db 205 204 

Qy 526 GLTPDLVQEACESELNEATGTKIAYETKVDLVQTSEAIQESLYPTAQLCPSFEEAEATPS 585 

Db 205 ; 204 

Qy 586 PVLPDIVMEAPLNSLLPSAGASWQPSVSPLEAPPPVSYDSIKLEPENPPPYEEAMNVAL 645 

Db 205 204 

Qy 646 KALGTKEGIKEPESFNAAVQETEAPYISIACDLIKETKLSTEPSPDFSNYSEIAKFEKSV 705 

Db 2 05 204 

Qy 706 PEHAELVEDSSPESEPVDLFSDDSIPEVPQTQEEAVMLMKESLTEVSETVAQHKEERLSA 765 

Db 205 204 

Qy 766 SPQELGKPYLESFQPNLHSTKDAASNDI PTLTKKEKISLQMEEFNTAI YSNDDLLSSKED 825 

Db 205 204 

Qy 826 KIKESETFSDSSPIEIIDEFPTFVSAKDDSPKLAKEYTDLEVSDKSEIANIQSGADSLPC 885 

Db 205 204 

Qy 886 LELPCDLSFKNI YPKDEVHVSDEFSENRSSVSKAS I SPSNVSALEPQTEMGS I VKSKSLT 945 
Db 205 204 

Qy 946 KEAEKKLPSDTEKEDRSLSAVLSAELSKTSWDLLYWRDIKKTGWFGASLFLLLSLTVF 1005 

III .lllllllllllllMIII ! 
Db 2 05 WDLLYWRDI KKTGWFGASLFLLLSLTVF 234 

Qy 1006 SIVSVTAYIALALLSVTISFRIYKGVIQAIQKSDEGHPFRAYLESEVAISEELVQKYSNS 1065 

- 1 1 1 II I M : 1 1 1 1 1 II 1 1 ! I MM I III 1 1 1 1! I M II 1 1 1 1 1 1 II 1 1 . II 

Db 235 SIVSVTAYIALALLSVTISFRI YKGVIQAIQKSDEGHPFRAYLESEVAISEELVQKYSNS 294 

Qy 1066 ALGHVNSTI KELRRLFLVDDLVDSLKFAVLMWVFTYVGALFNGLTLLILALI SLFSI PVI 1125 

Ml! Illlll IIIIIIIIIIIIUIIII Mlllllllllllllllll ^ II 
Db 2 95 ALGHVNCTI KELRRLFLVDDLVDSLKFAVLMWVFTYVGALFNGLTLLI LALI SLFSVPVI 3 54 

Qy 1126 YERHQVQIDHYLGLANKSVKDAMAKIQAKI PGLKRKAD 1163 

Mill IIIIIMIIIhllllllllllllllllllh 

Db 355 YERHQAQIDHYLGLANKNVKDAMAKIQAKI PGLKRKAE 3 92 



RESULT 10 
Q9GM33 



ID Q9GM33 PRELIMINARY; PRT; 179 AA. 

AC Q9GM33; 

DT 01-MAR-2001 (TrEMBLrel . 16, Created) 

DT 01-MAR-2001 (TrEMBLrel . 16, Last sequence update) 

DT 01-MAR-2003 (TrEMBLrel . 23, Last annotation update) 

DE Hypothetical 19.9 kDa protein. 

OS Macaca fascicularis (Crab eating macaque) (Cynomolgus monkey) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; 

OC Mammalia; Eutheria; Primates; Catarrhini; Cercopithecidae; 

OC Cercopithecinae; Macaca. 

OX NCBI_TaxID=9541; 

RN fl] 

RP SEQUENCE FROM N.A. 

RC TISSUE=Brain; 

RA Osada N. , Hida M., Kusuda J., Tanuma R. , Iseki K. , Hirai M. , Terao K. , 

RA Suzuki Y., Sugano S., Hashimoto K. ; 

RT "Isolation of full-length cDNA clones from macaque brain cDNA 

RT libraries."; 

RL Submitted (OCT-2000) to the EMBL/ GenBank / DDB J databases. 

DR EMBL; AB04 9853; BAB1673 9.1; -. 

DR InterPro; IPR003388; Reticulon. 

DR Pfam; PF02453; Reticulon; 1. 

DR PROSITE; PS50845; RETICULON; 1. 

KW Hypothetical protein. 

SQ SEQUENCE 179 AA; 19949 MW; 5F8CD4383FEE9E02 CRC64 ; 



Query Match 14.4%; Score 844; DB 6; Length 179; 

Best Local Similarity 95.5%; Pred. No. 8.5e-36; 

Matches 171; Conservative 6; Mismatches 2; Indels 0; Gaps 0; 



Qy 985 IKKTGVVFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRI YKGVIQAIQKSDEGHPF 1044 

^; M Ml II I 1 1 1 , 1 ' 1 1 1 II 1 1 1 1 1 ' 1 1 1 1 1 1 1 1 1 1 1 1 1 1 Mi M I 1 1 1 II M I M 

Db 1 MKKTGVVFGASLFLLLSLTVFS I VS VTAYI ALALLS VTI SFRI YRGVI QAI QKSDEGHPF 60 

Qy 104 5 RAYLESEVAI SEELVQKYSNSALGHVNSTI KELRRLFLVDDLVDSLKFAVLMWVFTYVGA 1104 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I h I I I I I I I I I I II I 

Db 61 RAYLESEVAI SEELVQKYSNSALGHVNCTI KELRRLFLVDDLVDSLRFAVLMWVFTYVGA 12 0 

Qy 1105 LFNGLTLLI LALI SLFS I PVI YERHQVQI DHYLGLANKSVKDAMAKI QAKI PGLKRKAD 1163 

IIIMIIIIIIIIIIIhllllllll llllllllllhlllllllllllllllllll: 
Db 121 LFNGLTLLI LALI SLFSVPVI YERHQAQI DHYLGLANKNVKDAMAKI QAKI PGLKRKAE 179 

RESULT 11 
Q8K4S4 

ID Q8K4S4 PRELIMINARY; PRT; 780 AA. 

AC Q8K4S4; 

DT 01-OCT-2002 (TrEMBLrel. 22, Created) 

DT 01-OCT-2002 (TrEMBLrel . 22, Last sequence update) 

DT 01-MAR-2003 (TrEMBLrel. 23, Last annotation update) 

DE Reticulon 1A. 

GN RTN1 OR RTN-1A. 

OS Mus mus cuius (Mouse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; 



OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus . 

OX NCBI_TaxID=10090; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=ICR; TISSUE=Brain ; 

RA Hirata T. , Nomura T. # Takagi Y., Sato Y. , Tomioka N., Fujisawa H., 

RA Osumi N . ; 

RT "Mosaic development of the olfactory cortex with Pax6 -dependent and 

RT -independent components ; 

RL Submitted (NOV-2001) to the EMBL/ GenBank/ DDB J databases. 

DR EMBL; AB074899; BAB96551.1; 

DR MGD; MGI: 1933947; Rtnl . 

DR InterPro; IPR001951; Histone_H4 . 

DR InterPro; IPR003388; Reticulon. 

DR Pfam; PF02453; Reticulon; 1. 

DR PROSITE; PS00047; HIST0NE_H4; 1. 

DR PROSITE; PS50845; RETICULON; 1. 

SQ SEQUENCE 780 AA; 83504 MW; 545F5638C576A069 CRC64 ; 



Query Match 13.5%; Score 792; DB 11; Length 780; 

Best Local Similarity 32.5%; Pred. No. 3e-32 ; 

Matches 242; Conservative 105; Mismatches 260; Indels 138; Gaps 27; 

Qy 517 EAAVSNM P EGLTPDLVQEACE S ELNEATG TKIAYETKVDLVQTSEAIQ-ESLY 568 

| | : | || : : | | : | : : | : | : :| |: : 

Db 76 ETASTGM- -AAVPDALDHSPSSTLKDGEGACYTSLISDVCYPPREDSAYFTGILQKENGH 133 

Qy 569 PTAQLCPSFEEAEATPSPVLPDIVMEAP LNS LLPSAGASWQPSVSPLEAPP 620 

I I II I II I lh: I hi : h I : ih 

Db 134 ITTSESP- -EEPE-TPGPSLPEVPGMEPQGLLSSDSGIEMTPAESTEVNKILADPLDQMK 190 

Qy 621 PVSYDSIKLE P EN P P P Y E EAMN VALKALGT K EG I KEPESFNAAV 664 

= 11= I I I : : I I I : lh = I I I 

Db 191 AEAYKYIDITRPQEAKGQEEQHPGLEDKDLDFKDKGTEVSTKAEGVRAPNQ-PAPVEGKL 249 

Qy 665 QETEAPYISIACDLIKETKLSTEPSPDFSNYSEI AKFEKS VPEHAELV 712 

: | MM | :: ::| :| :|| ::::|| :| 

Db 250 I KDHLFEESTFAP YI DELSD- - EQHRVSLVTAPVKI TLTEI EPPLMTATQETI PEKQDLC 307 

Qy 713 EDSSPESEPVDLFSDDS I PEVPQTQEEAVMLMKESLTEVSETVAQHK EERLSASP 767 

II- I h h I II I =1 I U -I 

Db 3 08 LKPSPDTVPTVTVSE PEDDSPGSVTPPSSGTEPSAAESQGKGSVSEDELIAAIK 361 

Qy 768 QELGKPYLESFQPN -LHSTKDAASNDI PTLTKKEKISLQMEEFNTAI YSNDDLLSS 822 

= I I : I I = lh I II 

Db 3 62 EAKGLSYETTESPRPVGQVADKPKTKTRSGLPTIPS PLDQEASS 4 05 

Qy 823 KEDKIKESETFSDSSPIEI IDEFPT FVSAKDDSPKLAK-EYTDLEVSDKSEIAN 875 

I | | |: |: | |: M || :|: | ::|: 

Db 406 AESGDS E I EL VS E - D PMAS EDAL PSG YVS FGH VSG P P PS PAS PS I QYS I LRE EREAEL - - 462 

Qy 876 IQSGADSLPCLELPCDL-SFKNI YPKDEVH VSDEFSENRSSVSKASISPSNVS 927 

II =1 II I III III: :|h 

Db 463 DSELI IE-SCDASSASEESPKREQDSPPMKPGALDAIREETGSRATEERAPSHQG 516 



QY 



928 ALE PQTEMGS I VKS KS LTKEAEKKL 



PSDTEKEDRSLSAVLSAE 



970 



Db 


517 


PVEPD-PMLSFAPAAALQSRPEPSSGDGASVPEPPRSQQQKPEEEAVSSSQSPTATEIPG 


575 


Qy 


971 


LS KTS WDLL YWRD I KKTGWFGAS LFLLLS LTVF SI VS VTA Y I ALAL 


1018 


Db 


576 


H Mlllllllhlhllh 1 II III Ihlll Ihlll 
PLGSGLMPPLPFFNKQKAIDLLYWRDIKQTGIVFGSFLLLLFSLTQFSWSWAYLALAA 


635 


Qy 


1019 


LSVTI SFRI YKGVI QAI QKSDEGHPFRAYLESEVAI SEELVQKYSNSALGHVNSTI KELR 

II Mill! MMMIIUMII h hi :||h: 1 


1078 


Db 


636 


LSATISFRI YKSVLQAVQKTDEGHPFKAYLELEITLSQEQIQKYTDCLQLYVNSTLKELR 


695 


Qy 


1079 


RLFLVDDLVDSLKFAVLMW VFTY VGALFNGLTLL I LAL I SLFS I PVI YERHQVQI DHYLG 


1138 


Db 


696 


Mill IIIIIMIIIIII: MMMIMMIhM-hhMhl Ml hi Ml 
RLFLVQDLVDSLKFAVLMWLLTYVGALFNGLTLLLMAWSMFTLPVVYVKHQAQVDQYLG 


755 


Qy 


1139 


LANKS VKDAMAKI QAKI PGLKRKAD 1163 
1 = MIIMIMI II h 




Db 


756 


L VRTH I NTWAKI QAKI PGAKRHAE 780 





RESULT 12 
Q8K0T0 

ID Q8K0T0 PRELIMINARY; PRT; 780 AA. 

AC Q8K0T0; 

DT 01-OCT-2 002 (TrEMBLrel . .22, Created) 

DT 01-OCT-2002 (TrEMBLrel. 22, Last sequence update) 

DT 01-MAR-2003 (TrEMBLrel. 23, Last annotation update) 

DE Similar to reticulon 1. 

GN RTN1 . 

OS Mus musculus (Mouse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Rodent ia; Sciurognathi ; Muridae; Murinae; Mus. 

OX NCBIJTaxID=10090; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC TISSUE=Retina; 

RA Strausberg R. ; 

RL Submitted (MAY-2002) to the EMBL / GenBank / DDB J databases. 

DR EMBL; BC030455; AAH30455.1; -. 

DR MGD; MGI: 1933947; Rtnl . 

DR InterPro; IPR001951; Histone_H4 . 

DR InterPro; IPR003388; Reticulon. 

DR Pfam; PF02453; Reticulon; 1. 

DR PROSITE; PS00047; HISTONE_H4; 1. 

DR PROSITE; PS50845; RETICULON; 1. 

SQ SEQUENCE 780 AA; 83572 MW; 2 9B47A58FC2F2 027 CRC64 ; 

Query Match 13.5%; Score 788; DB 11; Length 780; 

Best Local Similarity 32.4%; Pred. No. 4.9e-32; 

Matches 243; Conservative 104; Mismatches 256; Indels 146; Gaps 28; 

Qy 517 EAAVSNMPEGLTPDLVQEACESELNEATG TKIAYETKVDLVQTSEAIQ-ESLY 568 

I I : I II = : M : I : : I : I = =| h = 

Db 76 ETASTGM- -AAVPDALDHSPSSTLKDGEGACYTSLISDVCYPPREDSAYFTGILQKENGH 133 

Qy 569 PTAQLCPSFEEAEATPSPVLPDIVMEAP LNS LLPSAGASWQPSVSPLEAPP 620 

I I II I II I II- I hi : |: I : lh 

Db 134 1 TTSES P - - EEPE - TPGPSLPEVPGMEPQGLLSSDSGI EMTPAESTEVNKI LADPLDQMK 190 



Qy 621 PVSYDSI KLEPENPPPYEEA MNVALKALGTKEGI KEPESFNAAV 664 

: I I hi I h h II II- I I I 

Db 191 AEAYKYI DI TRPQEAKGQEEQHPGLEDKDLDFKDKDTEVSTKA EGVRAPNQ-PAPV 245 

Qy 665 QETEAPYISIACDLIKETKLSTEPSPDFSNYSEI AKFEKSVPEH 708 

Db 246 EGKLI KDHLFEESTFAPY I DELSD - - EQHRVSLVTAP VKI TLTE I E P PLMTATQETI PEK 303 

Qy 709 AELVEDSSPESEPVDLFSDDSIPEVPQTQEEAVMLMKESLTEVSETVAQHK EERL 763 

: I M- I 1= 1 = I II I H I =| : 

Db 3 04 QDLCLKPSPDTVPTVTVSE PEDDSPGSVTPPSSGTEPSAAESQGKGSVSEDELI 357 

Qy 764 SAS PQELGKPYL.ES FQPN LHSTKDAASNDI PTLTKKEKI SLQMEEFNTAI YSNDD 818 

= 1 = 11 = 1 I = =11= I 
Db 358 AAI KEAKGLSYETTESPRPVGQVADKPKTKTRSGLPTI PS PLDQ 401 

Qy 819 LLSSKEDKIKESETFSDSSPIEI IDEFPT FVSAKDDSPKLAK- EYTDLEVSDKS 871 

II I M I = I = 11= II II =|: I 

Db 402 EASSAESGDSEI ELVSE - DPMASEDALPSGYVSFGHVSGPPPS PAS PS I QYS I LREEREA 460 

Qy 872 EIANIQSGADSLPCLELPCDL-SFKNI YPKDEVH VSDEFSENRSSVSKASISP 923 

h II H II I III || | : :| 

Db 461 EL DSELI IE-SCDASSASEESPKREQDSPPMKPGALDAIREETGSRATEERAP 512 

Qy 924 SNVSALEPQTEMGSI VKSKSLTKEAEKKL PSDTEKEDRSLSAVLSAE 970 

h :|| I I : =| I | :::: || |:: 

Db 513 SHQGPVEPD-PMLSFAPAAALQSRPEPSSGDGASVPEPPRSQQQKPEEEAVSSSQSPTAT 571 

Qy 971 LSKTS WDLLYWRDI KKTGWFGASLFLLLSLTVFS I VSVTAYI 1014 

= 1 =lllllllll = ll = ll|: I II Ml IhlM 
Db 572 EI PGPLGSGLMPPLPFFNKQKAI DLLYWRDI KQTGI VFGSFLLLLFSLTQFS WSWAYL 631 

Qy 1015 ALALLSVTISFRIYKGVIQAIQKSDEGHPFRAYLESEVAISEELVQKYSNSALGHVNSTI 1074 

Ml 1 1 MINIM hlhlhlllllhllM 1= =1 = 1 =MI = : Mllh 

Db 632 ALAALSATI SFRI YKSVLQAVQKTDEGHPFKAYLELEI TLSQEQI QKYTDCLQLYVNSTL 691 

Qy 1075 KELRRLFLVDDLVDSLKFAVLMWVFTYVGALFNGLTLLI LALI SLFSI PVI YERHQVQI D 1134 

IMMIMI MIMMIMIM: I I I M I II I I I II = = h = I = I : M I = I =11 hi 
Db 692 KELRRLFLVQDLVDSLKFAVLMWLLTYVGALFNGLTLLLMAWSMFTLPVWVKHQAQVD 751 

Qy 1135 HYLGLANKS VKDAMAKI QAKI PGLKRKAD 1163 

MM = MMIIIMI II 1 = 

Db 752 QYLGLVRTHIN-TWAKIQAKI PGAKRHAE 78 0 



RESULT 13 
Q90638 

ID Q90638 PRELIMINARY; PRT; 760 AA. 

AC Q90638; 

DT 01-NOV-1996 (TrEMBLrel . 01, Created) 

DT 01 -NOV- 1996 (TrEMBLrel. 01, Last sequence update) 

DT 01-MAR-2003 (TrEMBLrel. 23, Last annotation update) 

DE ChS-Rex-b. 

OS Gallus gallus (Chicken) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Archosauria; Aves; Neognathae; Gall i formes; Phasianidae; Phasianinae; 



OC Gallus. 

OX NCB I JTax I D= 9 0 3 1 ; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC TISSUE=Brain; 

RX MEDLINE=96386034; PubMed-8793864 ; 

RA Baka I.D., Ninkina N.N. , Pinon L'.G. , Adu J. , Davies A.M. , 

RA Georgiev G.P., Buchman V.L.; 

RT "Intracellular compartmental ization of two differentially spliced s- 

RT rex/NSP mRNAs in neurons."; 

RL Mol . Cell. Neurosci. 7:289-303(1996). 

RN [2] 

RP SEQUENCE FROM N.A. 

RC TISSUE-Brain; 

RX MEDLINE=97183663 ; 

RA Ninkina N.N., Baka I.D., Buchman V.L.; 

RT "Rat and chicken s-rex/NSP mRNA : nucleotide sequence of main 

RT transcripts and expression of splice variants in rat tissues. "; 

RL Gene 184:205-210(1997). 

DR EMBL; U17606; AAC60075.1; -. 

DR InterPro; IPR003388; Reticulon. 

DR Pfam; PF02453; Reticulon; 1. 

DR PROSITE; PS50845; RETICULON; 1. 

SQ SEQUENCE 760 AA; 82502 MW; 465C1B429F799D5C CRC64 ; 

Query Match 13.0%; Score 762.5; DB 13; Length 760; 

Best Local Similarity 27.1%; Pred. No. 9.3e-31; 

Matches 28 5; Conservative 124; Mismatches 319; Indels 323; Gaps 34; 

Qy 142 PPPP AG-ASPLAEPAAPPSTPAAPKRRGSGSVDETLFALPAASE- -PVIPSSAEK 193 

III II I I I I I | h I :| I II = = | 

Db 4 PPDPQDLLLAGTAERWAAAGADEYAAGAALRDGDGAQQREQLAFGSAREHPPVAMATA-- 61 

Qy 194 IMDLMEQPGNTVSSGQEDFPSVLLETAASLPSLSPLSTVSFKEHGYLGNLSAVSSSEGTI 253 

Db 62 SPGVTASSRLFDYGSSSANGADSSFYTSLISDVHY 96 

Qy 254 EETLNEASKELPERATNPFVNRDLAEFSELEYSEMGSSF- -KGSPKGESAILVENTKEEV 311 

II II h : I = I II :| : 
Db 97 TTP RDNTYFTGV- YQQENSPI PCSGSTEGFN 126 

Qy 312 I VRSKDKEDLVCSAALHS PQES P VGKEDRW SPEKTMDI FNEMQMSWAPVREEY 366 

Mi: |||: I I . |= |= | = = I 

Db 127 ALGHPVQDVTGFESRGLFSLDSGIEMTPAESAEVDKSLTDPMKVEG 172 

Qy 3 67 ADFKPFEQAWEVKDTYEGSRDV1AARANVESKVDRKCLEDSLEQKSLGKDSEGR 42 0 

: |:| |: | :|: : | | :| 

Db 173 YKYMDI SRPEE I K- - YQEKHD PDSEDESPDLIDEYRGTPIGSGHAAEPQRTTA 223 

Qy 421 NEDASFPSTPEPVKDSSRAYI TCASFTSATESTTANTFPLLEDHTSENKTDEKKI EERKA 480 

:| I : I : : I I II =| I I = II = = | 

Db 224 SEAIKAPKEQDPLEDKSFRDQHNASWTAPVKI TLTETPGAREATS KEA 272 

Qy 481 QIITEKTSPKTSNPFLVAVQDSEADYVTTDTLSKVTEAAVSNMPEGLTP DLVQEAC 53 6 

= 1= I 1 = : I II : h II = II I = = 

Db 273 SVTQPKSGLKPSHEWPTVMVSEPE DDSPGSVTPPSSGTEPSGSESQGKGSLSEDEL 329 



Qy 537 ESELNEATGTKIAYETKVDLVQTSEAIQESLYPTAQLCPSFEEAEATPSPVLPDI VMEAP 596 

I = II I = = 11 II I h I |= I I I = 

Db 330 ISAIKEAKG- -FSFETSE- -VQQSPAV SAEKQEQKMKPGRPAV 368 

Qy 5 97 LNSLLPSAGAS WQPSVSPLEAPPPVSYDS - I KLEPENPPPYEEAMNVALKALGTKEGI K 655 

II II I II hi hi II :: I 

Db 369 PS PLDNEASSAESGDSE I ELVS EDPLAAEEVLHSNYMTFSH I GG - - 412 

Qy 656 EPESFNAAVQETEAPYISIACDLIKETKLSTEPSPDFSNYSEIAKFEKSVPEHAELVEDS 715 

II M :| | : |: :||: :| 

Db 413 PPPS PASPSIQYS ILREEREAELDSELI IES 443 

Qy 716 SPESEPVDLFSDDSIPEVPQTQEEAVMLMKESLTEVSETVAQHKEERLSASPQELGKPYL 775 

I II :|: III : - : : I III I I 
Db 444 CDAS SASEESPKREQDSPLMKPMVMDI 1 KEENSSRASASDYEASK 488 

Qy 776 ESFQPNLHSTKDAASNDIPTLTKKEKISLQMEEFNTAIYSNDDLLSSKEDKIKESETFSD 835 

I :: I : : :| :: 

Db 489 TTESRMN RENLADSASYLK 507 

Qy 836 SSPIEI IDEFPTFVSAKDDSPKLAKEYTDLEVSDKSEIANIQSGADSLPCLELPCDLSFK 895 

II i I =11.= = I 

Db 5 08 SS FV APKVSSE 518 

Qy 8 96 NIYPKDEVHVSDEFSEN RSSVSKASISPSNVSALEPQTEMGSI VKSKSLTKEAEKKL 952 

I ::| | : : : :: | || =| = = | = 

Db 519 PPTSAVSTEELKERI ILKKPIEETWNQSKVS SKDSGKRS 558 

Qy 953 PSDTEKEDRSLSAVLSAELSKTSVVDLLYWRDIKKTGVVFGASLFLLLSLTVFSIVSA/TA 1012 

I h I hi : I II III Ihlll I 

Db 559 P LAL P LL P FLN KQ KA I NLL YWRD I KQTG I VFG S LLLLL F S LTQ F S WSWA 609 

Qy 1013 YIALALLSVTISFRIYKGVIQAIQKSDEGHPFRAYLESEVAISEELVQKYSNSALGHVNS 1072 

hill II llllllll hlhlhlllllhllh h = h = MM- =111 
Db 610 YLALAGLSATISFRIYKSVLQAVQKTDEGHPFKAYLDMEMNLSQDQIQKYTDCLQLYVNS 669 

Qy 1073 TI KELRRLFLVDDLVDSLKFAVLMWVFTYVGALFNGLTLLI LAL I SLFSI PVI YERHQVQ 1132 

MINIMI Mllllllllllh I II iri|||!lllhh:hh: lh|:::| I 

Db 670 TVKELRRLFLVQDLVDSLKFAVLMWLLTYVGALFNGLTLLIMAWSMFTLPVVYDKYQAQ 72 9 

Qy 1133 I DHYLGLANKS VKDAMAKI QAKI PGLKRKAD 1163 

II I I I I = =111111111 lllh 

Db 730 IDQYLGLVRTHINTWAKIQAKI PGAKRKAE 760 



RESULT 14 
Q8CCU2 

ID Q8CCU2 PRELIMINARY; PRT; 643 AA. 

AC Q8CCU2; 

DT 01-MAR-2003 (TrEMBLrel . 23, Created) 

DT 01-MAR-2003 (TrEMBLrel . 23, Last sequence update) 

DT 01-MAR-2 003 (TrEMBLrel. 23, Last annotation update) 

DE Reticulon 3 . 

OS Mus musculus (Mouse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus. 
OX NCBI_TaxID=10090; 



RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=C57BL/6J; TISSUE-Medulla oblongata; 

RX MEDLINE=22354683 ; PubMed=12466851 ; 

RA The FANTOM Consortium, 

RA the RIKEN Genome Exploration Research Group Phase I & II Team; 

RT "Analysis of the mouse transcriptome based on functional annotation of 

RT 60,770 full-length cDNAs . " ; 

RL Nature 420:563-573(2002). 

DR EMBL; AK032109; BAC27708.1; -. 

SQ SEQUENCE 643 AA; 68769 MW; 15E3335B280533F8 CRC64 ; 



Query Match 12.0%; Score 700; DB 11; Length 643; 

Best Local Similarity 30.3%; Pred- No. l.le-27; 

Matches 227; Conservative 109; Mismatches 234; Indels 180; Gaps 30; 
Qy 442 TCA-SFTSATESTTANTFPLLEDHTSENKTDEKKIEERKAQI ITEKTSPKTSNPFLVAVQ 500 
Db 4 6 SCADSFVSSSSSQPVSIFSTSQGDWTEAFTEGKPVR 81 

Qy 5 01 DSEADYVTTDTLSKVTEAAVSNMP EGLTPDLVQEACESELNEATGTKIAYETKVD 555 

I I : I I || 
Db 82 DYLSSTKEAGGNGVPGSSQLHSELPGSMPEKWVSGSGAAT VEVT 125 

Qy 556 LVQTSEAIQESLYPTAQLCPSFEEAEATPSPVLPDIVMEAPLNSLLPSAGASWQPSVSP 615 

I I h I I- I |: II I I II I : : 

Db 126 LPNLRGAWPNSVMGEVTEVDSSGESDDT VI EDIT-EKP- -DSLPSAAAKTSEREIK- 178 

Qy 616 LEAPPPVSYDSIKLE- -PENPPPYEEAMNVALKALGTKEGIKEPESFNAAVQE TEA 669 

I I I = - = i : I : |:| : : | : | | | 

Db 179 -ETP SRETVRSEMCENSEQPQAQPETPTQKSLEGEVASQVPNTLNEVTPEKLDMTNN 234 

Qy 670 PYISIAC- -DLIKETKLS-TEPSPDFSNYSEIAKFEKSVPEHAELVEDSSPESEPVDLFS 726 

Db 235 PKVCSAAPPSVLNETGFSLTVPAS AKLESLLGKYVEDTDGSSPE 278 

Qy 727 DDSI PEVPQTQEEAVMLMKES- -LTEVSETVAQHKEERLSASPQELGKPYLESFQPNLHS 784 

I : : =h =: =1 I I I =1 I " = Ml II 
Db 279 -DLMAVLTGAEEKGIVDKEEGDVLEAVLEKIADFK NTLPVEL LHE 322 

Qy 785 TKDAASNDIPTLTKKEKISLQMEEFNTAIYSNDDLLSSKEDKIKESETFSDSSPIEIIDE 844 

II I I = h II I 
Db 323 SE LSGSETKNIKSKYSEDSR- -ETTGG 347 

Qy 845 FPTFVSAKDDSPKLAKEYTDLEV SDKSEIANIQSGADSLPCLELPCDLSFKN 896 

Db 348 APTM SPDLEQEQLTI RAI KELGERQAEKVQDEGI SSGG ' KLKQ 389 

Qy 897 I Y - PKDEVHVSDEFSENRSSVSKAS IS- PSNVSALEPQTEMGS I VKSKSLTKEAEKKLPS 954 

: I : I : h : : : | | : :: | : M I I II 
Db 390 TFAPQSGPQSSSDILEHTDVKTGSDLGI PKNPTI IK-NTRIDSI SSLTK 437 

Qy 955 DTEKEDRSLSAVLSAELSKTSWDLLYWRDIKKTGWFGASLFLLLSLTVFSIVSVTAYI 1014 

M — lh II I !|::|||:|||| Ml : I Mill li-ll :|: 
Db 438 -TEMVNKN VLARLLSDFPVHDLI FWRDVKKTGFVFGTTLI MLLSLAAFS VI SWSYL 4 93 



Qy 1015 ALALLSVTISFRIYKGVIQAIQKSDEGHPFRAYLESEVAISEELVQKYSNSALGHVNSTI 1074 



Illlllllllhll IIIMIHIMHIh - :| I I hh III 

LALLSVTI SFRVYKSVIQAVQKSEEGHPFKAYLDVDITLSSEAFHNYMNAAMVHVNKAL 553 



Db 


494 


I LALLS VT I S F R VYKS VI QAVQKS E EGH P FKAYLDVD I TLS S EAFHN YMNAAMVH VNKAL 


553 


Qy 


1075 


KELRRLFLVDDLVDSLKFAVLMWVFTYVGALFNGLTLLI LALI SLFS I P VI YERHQVQ I D 

I = lllhl MM II Ih MMMMM MM = MMMMM- III 

KL 1 1 RLFLVEDLVDSLKLAVFMWLMTYVGAVFNG I TLL I LAELLVFS VP I VYEKYKTQ I D 


1134 


Db 


554 


613 


Qy 


1135 


HYLGLANKS VKDAMAKI QAKI PGL - KRKAD 1163 




Db 


614 


11 = 1 = 1 1 = 11111 = 11= 1 = 11 = 
H Y VG I ARDQTKS I VE KI QAKLPG I AKKKAE 643 





RESULT 15 
Q9BQ59 

ID Q9BQ59 PRELIMINARY ; PRT; 199 AA. 

AC Q9BQ59; 

DT 01-JUN-2001 (TrEMBLrel . 17, Created) 

DT 01-JUN-2001 (TrEMBLrel. 17, Last sequence update) 

DT 01-MAR-2003 (TrEMBLrel. 23, Last annotation update) 

DE Similar to reticulon 1. 

OS Homo sapiens (Human) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

OX NCBI_TaxID=9606; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC TISSUE=Lung ; 

RA Strausberg R. ; 

RL Submitted (FEB-2001) to the EMBL/GenBank/DDBJ databases. 

DR EMBL; BC003003; AAH03003.1; -. 

DR EMBL; BC000314; AAH00314.1; -. 



DR 


InterPro ; 


; IPR001951; Histone_H4 . 




DR 


InterPro, 


; IPR003388; Reticulon. 




DR 


Pfam; PF02453; Reticulon; 1. 




DR 


PROSITE; 


PS 00 04 7; HISTONE_H4; 1. 




DR 


PROSITE; 


PS5084 5; RETICULON; 1. 




SQ 


SEQUENCE 


199 AA; 22642 MW; 7CFA44CC568DF6D8 CRC64 ; 




Query Match 


11.7%; Score 685; DB 4; Length 199; 




Best Local Similarity 67.0%; Pred. No. 1.2e-27; 




Matches 12E 


i; Conservative 32; Mismatches 31; Indels 0; Gaps 


0; 


Qy 


973 


KTSVVDLLYWRDIKKTGWFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVI 


1032 


Db 


9 


h =111111111 = 11 = 111= 1 II III Ihlll Ihlll II llllllll h 

KSQAI DLLYWRD I KQTG I VFGS FLLLLFSLTQFS WS WAYLALAALSAT ISFRI YKSVL 


68 


Qy 


1033 


QAIQKSDEGHPFRAYLESEVAISEELVQKYSNSALGHVNSTIKELRRLFLVDDLVDSLKF 
IHhIIIIIHIII h :|:| 1 h M 1 I I! 1 


1092 


Db 


69 


QAVQKTDEGHPFKAYLELEITLSQEQIQKYTDCLQFYVNSTLKELRRLFLVQDLVDSLKF 


128 


Qy 


1093 


AVLMWVFTY VGALFNGLTLL I LAL I SLFS I PVI YERHQVQ I DHYLGLANKS VKDAMAKI Q 


1152 


Db 


129 


11111= Mllll!lllll|::|::|:|::|hl =11 III MM : MIM 
AVLMWLLT YVGALFNGLTLLLMAWSMFTLP WYVKHQAQI DQ YLGLVRTH I NAWAKI Q 


188 


Qy 


1153 


AKI PGLKRKAD 1163 




Db 


189 


MIM II h 
AKIPGAKRHAE 199 





Search completed: December 19, 2003, 15:36:42 
Job time : 51 sees 



